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How to Use This Book 


Organization 


The first chapter of this text is a review of functions. Students who have studied our Introduction to Algebra and 
Intermediate Algebra textbooks can safely skim this material. Students who are less familiar with functions should 
study this chapter closely. 


The rest of the book is divided into three parts. Chapters 2 through 5 are a thorough exploration of trigonometric 
functions, trigonometric identities, and their application to Euclidean and analytic geometry. Chapters 6 through 
8 cover complex numbers and their relationships to trigonometry and geometry. Chapters 9 through 12 offer an 
introduction to vectors and matrices in two and three dimensions, and in chapter 13 we apply vectors to Euclidean 
geometry. 
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We believe that the best way to learn mathematics is by solving problems. Lots and lots of problem$"in fact, we 
believe that the best way to learn mathematics is to try to solve problems that you don’t know how to do. When 
you discover something on your own, you'll understand it much better than if someone had just told you. 


Most of the sections of this book begin with several problems. The solutions to these problems will be covered 
in the text, but try to solve the problems before reading the section. If you can’t solve some of the problems, that’s 
OK, because they will all be fully solved as you read the section. Even if you can solve all of the problems, it’s still 
important to read the section, both to make sure that your solutions are correct, and also because you may find 
that the book’s solutions are simpler or easier to understand than your own. 
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Explanation of Icons 


Throughout the book, you will see various shaded boxes and icons. 


Concept: This will be a general problem-solving technique or strategy. These are the “keys” 
O] to becoming a better problem solver! 


Important: This will be something important that you should learn. It might be a formula, 
Y a solution technique, or a caution. 


: a 


Sidenote: This box will contain material that, although interesting, is not part of the main, 
| A material of the text. It’s OK to skip over these boxes, but if you read them, you 
| might learn something interesting! 


| Bogus Solution: Just like the impossible cube shown to the left, there’s something ag 
| with any “solution” that appears in this box. 
| n s 
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Exercises, Review Problems, and Challenge Problems 


Most sections end with several Exercises. These will test your understanding of the material that was covered in 
the section. You should try to solve all of the exercises. Exercises marked with a x are more difficult. 


Each chapter concludes with many Review Problems and Challenge Problems. The Review Problems test 
your basic understanding of the material covered in the chapter. Your goal should be to solve most or all of 
the Review Problems for every chapter—if you're unable to do this, it means that you haven't yet mastered the - 
material, and you should probably go back and read the chapter again. 


The Challenge Problems are generally more difficult than the other problems in the book, and will really test 
your mastery of the material. Some of them are very hard—the hardest ones are marked with a x. Don’t expect to 
be able to solve all of the Challenge Problems on your first try—these are difficult problems even for experienced 
problem solvers. If you are able to solve a large number of Challenge Problems, then congratulations, you are on 
your way to becoming an expert problem solver! 


Hints 


Many problems come with one or more hints. You can look up the hints in the Hints section in the back of the 
book. The hints are numbered in random order, so that when you're looking up a hint to a problem you don’t 
accidentally glance at the hint to the next problem at the same time. 


It is very important that you first try to solve each problem without resorting to the hints. Only after you've 
seriously thought about a problem and are stuck should you seek a hint. Also, for problems which have multiple 
hints, use the hints one at a time; don’t go to the second hint until you’ve thought about the first one. 
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Solutions 


The solutions to all of the Exercises, Review Problems, and Challenge Problems are in the separate solutions book. 
If you are using this textbook in a regular school class, then your teacher may decide not to make this solutions 
book available to you, and instead present the solutions him/herself. However, if you are using this book to learn 
on your own, then you probably have a copy of the solutions book, in which case there are some very important 
things to keep in mind: 


1. Make a serious attempt to solve each problem before looking at the solution. Don’t use the solutions book 
as a crutch to avoid really thinking about the problem first. You should think hard about a problem before 
deciding to look at the solution. On the other hard, after you've thought hard about a problem, don’t feel 
bad about looking at the solution if you're really stuck. 


2. After you solve a problem, it’s usually a good idea to read the solution, even if you think you know how to 
solve the problem. The solutions book might show you a quicker or more concise way to solve the problem, 
or it might have a completely different solution method than yours. 


3. If you have to look at the solution in order to solve a problem, make a note of that problem. Come back to it 
in a week or two to make sure that you are able to solve it on your own, without resorting to the solution. 


Resources 


Here are some other recommended resources for you to further pursue your study of mathematics: 


e Other Art of Problem Solving textbooks. This series of texts forms a full mathematics curriculum for 
high-performing students in grades 7-12. 


e The Art of Problem Solving books, by Sandor Lehoczky and Richard Rusczyk. Whereas the book that you’re 
reading right now will go into great detail of a narrow set of subjects, the Art of Problem Solving books cover 
a wide range of problem solving topics across many different areas of mathematics. 


e The www. artofproblemsolving.com website. The publishers of this book are also the webmasters of the Art 
of Problem Solving website, which contains many resources for students: 
- a discussion forum 
— online classes 
— resource lists of books, contests, and other websites 
— a PIRX tutorial 
— a math and problem solving Wiki 
— and much more! 
e Youcanhone your problem solving skills (and perhaps win prizes!) by participating in various math contests. 


More details about some of these contests are on page vii, and links to these and many other contests are 
available on the Art of Problem Solving website. 


A Note to Teachers 


We believe students learn best when they are challenged with hard problems that at first they may not know how 
to do. This is the motivating philosophy behind this book. 


Rather than first introducing new material and then giving students exercises, we present problems at the 
start of each section that students should try to solve before new material is presented. The goal is to get students 
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to discover the new material on their own. Often, complicated problems are broken into smaller parts, so that 
students can discover new techniques one piece at a time. After the problems, new material is formally presented 
in the text, and full solutions to each problem are explained, along with problem-solving strategies. 


We hope that teachers will find that their stronger students will discover most of the material in this book on 
their own by working through the problems. Other students may learn better from a more traditional approach 
of first seeing the new material, then working the problems. Teachers have the flexibility to use either approach 
when teaching from this book. 


Sections marked with a * contain supplementary material that may be safely skipped. In general, chapters 
are not equal in length, so different chapters may take different amounts of classroom time. 


Links 


The Art of Problem Solving website has a page containing links to websites with content relating to material in 
this book, as well as an errata list for the book. This page can be found at: 
http://www.artofproblemsolving.com/BookLinks/Precalculus/links.php 


Extra! Occasionally, you'll see a box like this at the bottom of a page. This is an “Extra!” and might bea’ 
(mp >it quote, Some biographical or historical background, or perhaps an interesting idea to think about. 
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Contests 


We would like to thank the following contests for allowing us to use a wide selection of their problems in this 
book: 


e The American Mathematics Competitions, a series of contests for U.S. middle and high school students. The 
AMC 8, AMC 10, and AMC 12 contests are multiple-choice tests that are taken by over 400,000 students every 
year. Top scorers on the AMC 10 and AMC 12 are invited to take the American Invitational Mathematics 
Examination (AIME), which is a more difficult, short-answer contest. Approximately 10,000 students every 
year participate in the AIME. Then, based on the results of the AMC and AIME contests, about 500 students 
are invited to participate in the USA Mathematical Olympiad (USAMO), a 2-day, 9-hour examination 
in which each student must show all of his or her work. Results from the USAMO are used to invite a 
number of students to the Math Olympiad Summer Program, at which the U.S. team for the International 
Mathematical Olympiad (IMO) is chosen. More information about the AMC contests can be found on the 
AMC website at www.unl.edu/amc. (Note: Problems marked AHSME are from the American High School 
Math Exam, the predecessor of the AMC 12.) 


The Mandelbrot Competition, which was founded in 1990 by Sandor Lehoczky, Richard Rusczyk, ané:sam 
Vandervelde. The aim of the Mandelbrot Competition is to provide a challenging, engaging math: .natical 
experience that is both competitive and educational. Students compete both as individuals anc. ın teams. 
The Mandelbrot Competition is offered at the national level for more advanced students and at the regional 
level for less experienced problem solvers. More information can be found at www.mandelbrot.org. 


The USA Mathematical Talent Search (USAMTS), which was founded in 1989 by Professor George 
Berzsenyi. The USAMTS is a free mathematics competition open to all United States middle and high 
school students. As opposed to most mathematics competitions, the USAMTS allows students a full month 
to work out their solutions. Carefully written justifications are required for each problem. More information 
is available at www.usamts .org. 


e The American Regions Math League (ARML), which was founded in 1976. The annual ARML competition 
brings together nearly 2,000 of the nation’s finest students. They meet, compete against, and socialize with 
one another, forming friendships and sharpening their mathematical skills. The contest is written for high 
school students, although some exceptional junior high students attend each year. The competition consists 
of several events, which include a team round, a power question (in which a team solves proof-oriented 
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questions), an individual round, and two relay rounds. More information is available at www.arml.com. 
Some questions from the New York State Math League (NYSML) also appear in this text. NYSML is a New 
York statewide competition that inspired the start of ARML. 


e The Harvard-MIT Mathematics Tournament (HMMT), which is an annual math tournament for high school 
students, held at MIT and at Harvard in alternating years. It is run exclusively by MIT and Harvard students, 
most of whom themselves participated in math contests in high school. More information is available at 
web.mit.edu/hmmt/. 


Some other sources for problems in the book are listed below: 


e Mathematical journals Mathematics & Informatics Quarterly (M&IQ), Mathematical Mayhem, and Crux Mathe- 
maticorum. 


e Various Canadian national competitions, such as the Canadian Invitational Mathematics Contest (CIMC), 
the Canadian Open Mathematics Challenge (COMC), the Canadian Mathematics Olympiad (CMO), and 
contests run by the Centre for Education in Mathematics and Computing (CEMC). 


e Problems from various countries’ national math contests. These problems are cited by country name in the 
text. 


How We Wrote This Book 


This book was written using the TEX document processing system. We thank the authors of the various ISTEX 
packages that we used while preparing this book, and also the authors of The 4TeX Companion for writing a 
reference book that is not only thorough but also very readable. The diagrams were prepared using METAPOST 
and Asymptote. 


About Us 


This book is a collaborative effort of the staff of Art of Problem Solving. Richard Rusczyk was the lead author of 
the text and primary editor of the solutions manual. Naoki Sato was the lead author of the solutions manual, and 
selected and organized many of the problems in the text. David Patrick and Jeremy Copeland read several drafts 
oś the text and solutions and made many, many helpful suggestions. Mathew Crawford and Simon Rubinstein- 
Saldo collected problems for other Art of Problem Solving books and for Art of Problem Solving classes that are 
included in this text. The following people also made contributions through proofreading, suggesting material 
for the textbook, or writing problems for the text: Joshua Brakensiek, James Choi, Billy Dorminy, Gerhardt Hinkle, 
Patrick Hulin, Vincent Le, Jeff Manning, Aaditya Ramesh, Max Rosett, Sam Ruth, Aaron Schild, Justin Venezuela, 
and Valentin Vornicu. Vanessa Rusczyk designed the cover and the chapter heading images. 


Websites & Errata 


We used several websites as source material for the text. Links to these sites and errata for the text and solutions 
are provided at 


http://www.artofproblemsolving.com/BookLinks/Precalculus/links. php 
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Extra! 
a Lm | HEB 


Consider the function 


O 


If we input 0 to this function, we have f(0) = 0? = 2i. Now, suppose we take this output and put 
it back into f. We then have 


FEO = O = (21° + 27 = —4 + 23. 


Similarly, we have 


FO) = FFFO)) 
= f(—4 + 21) 
= (-4 + 2) + 2i 
= (—4)* + 2(—4)(2i) + (2i)? + 2i 
= 12 — 14i. 
If we keep going, computing f*(0), f’ (0), and so on, the magnitudes of the results just get larger 


and larger. However, if we change our starting function just a little bit, we have a very different 


result when we compute f"(0) for larger and larger values of n. For example, suppose we start 
with the function 


Eo 
f(z) =z" + 5 
Now, if we compute f(0), £7(0), °(0), and so on, we have 


f0) = 7 
o= s(i) -iei 
A 


f2 (0) = —0.1324 + 0.3883i, 


f2 (0) = —0.1359 + 0.3930i. 


If we keep going and going, we'll find that our results stabilize somewhere around —0.136 + 
0.3931. The magnitudes of the results don’t just keep growing and growing forever! This 
observation was one of the early breakthroughs in the a of fractals. 

The only Ta between the functions f(z) = z? + 2i and f(z) = z* + 4 is the constant that 
is added to z° in the function definition. This small change makes a vey large difference when 
we repeatedly compar the function with itself. Of course, 2i and 5 aren’t the only numbers 
we can gue to z? to form our tunetion f(z). With such a big difference between the behavior of 
f(2) = 2 + 2iand f(z) = z + į, we should wonder what happens when we add other complex 
numbers to 2”. : 


Continued on the following page. 


—————————— 


Extra! 


We can create a graph based on our results when we try adding different numbers to z* in 


wb our function definition. We consider all functions of the form f(z) = 27 +c, where c is a constant 


complex number. For each value of c, we color the point on the complex plane that represents c 
black or white. If there’s no limit to how large the magnitude of f"(0) gets as n gets larger and 
larger, then we color the point white. So, our work with z? + 2i tells us that the point represented 
by c = 2i is colored white. If there is a limit to how large the magnitude of f"(0) gets as n gets 


larger and larger, then we color the point that represents c black. So, the point that represents 
c = 5 is colored black. 


When we do this coloring for every complex number c on the complex plane, we have a 
graph that is the geometric depiction of the Mandelbrot set, shown above. Unfortunately, low- 
resolution black-and-white printing cannot do the Mandelbrot set justice. Visit the links page 
cited on page viii to find links to some fascinating pictures of the Mandelbrot set. If, instead of 
using white, we color points based on how quickly f"(0) grows as n grows, the result is quite 
beautiful. Also, some of these websites contain interactive applets that allow you to zoom in on 
parts of the Mandelbrot set. With these applets, you'll see some startling images. You'll get a 
small preview of these images in the chapter headings of this book. The final image in the series 
of chapter headings is zoomed-in by a factor of over 1,000,000! As you'll see, as we zoom in, we 
frequently find features of the Mandelbrot set that are very similar to the original Mandelbrot set! 
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The chief function of the body is to carry the brain around. — Thomas Edison 
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Functions Review 


1.1 Function Basics 


Suppose we have a machine that accepts any number, multiplies it by two, adds three to this product, and then 
outputs the result. Mathematically speaking, this machine is a function because there is only one possible output 
from the machine for each input to the machine. We can give this function a label, f, and write the function as 


f(x) = 2x +3. 


This simple equation describes the machine. The “(x)” after f on the left side indicates that we are putting x into 
the function f. When speaking, f(x) is read “f of x.” The x in the equation f(x) = 2x +3 is a dummy variable, 
which means that it is essentially a placeholder. When we put a specific number in our machine, we replace x 
with that number in the equation f(x) = 2x + 3 to determine what the machine outputs. For example, 


f(6) =2-54+3=13, 


so the machine outputs 13 when we put 5 into it. 


Functions are really that simple. We define the function, and then whenever we input a number to the function, 
we follow the definition to get an output. Usually, we use an equation, such as f(x) = 2x + 3, to define a function. 
For the obvious reason, f is the most commonly used label for functions. 


In this chapter, we will discuss only functions that take real number inputs and give real number outputs. The 
domain of a function consists of all the values we are able to input to the function and get an output, and the 
range of the function consists of all the values that can come out of the function. 


Extra! It isn’t that they can't see the solution. It is that they can’t see the problem. | ‘ 
{1a > 1m ce -G.K Chesterton 
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For example, consider the function 


fl) = 5. 
1 


The value x = 3 is not part of the domain of this function, because =; is not defined when x = 3. We can safely 
put any other value of x into this function, so the domain of f is “all real numbers except 3.” Similarly, there is no 
value of x for which it is possible to make the function output 0. However, we can make the function output any 


other real number, so the range of f is “all real numbers except 0.” 


We call a function that can only output real numbers a real-valued function, or, even more simply, a real 
function. Unless a problem states otherwise, you can assume that all functions in this chapter are real functions. 
Moreover, you can assume that they are only defined for real number inputs. Later in this book we will explore a 
variety of functions that can take nonreal inputs and give nonreal outputs. 


We will sometimes use interval notation to express the domain and range of a function. For example, we can 
denote “all real numbers greater than 3 and less than 5” by the interval (3,5). We use “(” and ”)” to indicate that 
the values 3 and 5 are not included. Notice that we don’t write (5,3); the lesser end of the interval always comes 
first. We write x € (3,5) to indicate that x is a number in this interval, The ”x €” means “x is in,” and (3,5) indicates 
the real numbers greater than 3 and less than 5, as we just described. 


“ k 


To include a boundary value in the interval, we use “[” for the lower bound and ”]” for the upper bound. For 
example, the statement x € (—3, 5] means —3 < x <5 and y € [-12.2, 0] means -12.2 < y < 0. 


Finally, to indicate an interval that has no upper bound or no lower bound (or neither), we use the co symbol. 
For example, we write t > —3 as t € (—3, too). The ”(—3” part indicates that no numbers equal to —3 or lower than 
—3 are in the interval. The ”+00)” part indicates that the interval continues forever in the positive direction; that 
is, there is no upper bound. So, the interval (—3, +00) is all real numbers greater than —3. Similarly, the statement 
w E (—co,—2] is the same as w < —2. Notice that we always use “(” with —oo and “)” with +00, instead of ”[” and 
“J”. In both cases, there is not a boundary value to include in the interval, so we must use ”(” or ”)” instead of ”[” 
oni. 


To write all real numbers in interval notation, we can write (—00, +00). We often use the symbol R to denote 
“all real numbers.” Therefore, we typically write the interval (~o, co) as “IR”, so x € IR means ”x is a real number.” 


Problem 1.1: Let f(x) = 2x* — 3. 
(a) Find f(2). 
(b) Find all values of t such that f(t) = 47. 
(c) Find f(3x + 1). 
Problem 1.2: A function can be defined to accept multiple inputs. For example, let 


g(a, b,c) = 3a — 2b + 7c’. 


(a) Evaluate ¢(3, -5,—1). (b) Find b if 9(b,2,-1) = 21. 


Problem 1.3: Suppose t(x) = ax* + bx? + x + 5, where a and b are constants. In this problem, we find t(4) if 
t(-4) = 3. 
(a) Use t(—4) = 3 to write an equation in terms of 4 and b. 


(b) Express (4) in terms of a and b. , 
(c) Find f(4). 
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Problem 1.4: Find the domain and range of each of the following functions: 


(a) f@)=2x-3 OOE A 
(b) f(x) = V-2x +7 (d) h(x) = 9x2 +4 


Problem 1.5: Suppose p(x) = 2 yx + 3, but that p(x) is only defined for 4 < x < 9. In other words, the domain of 
p is all real numbers from 4 to 9. 


(a) Is there a value of x for which p(x) = 
(b) What is the range of p? 


Problem 1.6: Find the domain of each of the following functions: 


@ f@= 3 


= - 
Preview 7. NO@tice that = a — U) 
XL x-1 


=x. Are f(x) = ose and g(x) = x the same function? 


| Problem 1.1: Let f(x) = 2x? - 

(a) Find f(2). 
(b) Find all values of t such that f (t) = 47. 
(c) Find f(3x +1). 


Solution for Problem 1.1: 
(a) We simply replace x with 2 in the function definition, which gives us f(2) = 2(2% -3=8-3=5. 
(b) If f(t) = 47, we must have 2# — 3 = 47. Adding 3 to both sides gives 2t? = 50, and dividing by 2 gives 
tf? = 25. The two values of t that satisfy this equation are t = 5 and t = —5. Checking our answer, we find 
f(5) = 265" - 3 = 47 and f(-5) = 2(-5)? — 3 = 47. 


Concept: We can often check our answers in algebra problems by plugging the solutions 
O==3 © we find back into the original question. 


(c) Wecaninputan entire expression into a function by replacing the dummy variable in the function definition 
with the expression. Moreover, we can replace the x in the function definition f(x) = 2x? — 3 with an 


expression that contains x. For example, we have 


f(3x +1) = 2(3x +1) -3 = 2x7 + 6x 4+ 1) -3 = 18x + 12x-1. 


SS 
| Problem 1.2: A function can be defined to accept multiple inputs. For example, let 


| g(a, b,c) = 3a — 2b + 7c’. 


(a) Evaluate 9(3, —5, —1). (b) Find b if g(b,2,-1) = 21. 
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Solution for Problem 1.2: 


(a) We replace the dummy variables a, b, and c in the ae: on with the values 3, —5, and —1, in that 
order. This gives us g(3, —5, —1) = 3(3) — 2(-5) + 7(-1)7 = 


(b) What's wrong with this solution: 


‘Bogus Solution: Wehave yain a ee ee he 
“os! g(b,2,-1) = 3(2) — 2b-+ 7(-1)? = ~2b + 13, 


_ so we seek the value of b such that —2b + 13 = 21, which is b=-4 
We see our Meee immediately if we check our answer. We have 
g(—4,2, —1) = 3(-4) — 2(2) + 7(-1)? = -9, 


but we want g(b,2, —1) = 21. Looking at our Bogus Solution above, we see that in finding an expression for 
g(b,2,—1), we set a = 2 and c = —-1, and left b as b. But this isn’t what we must do to evaluate g(b, 2, -1). 


WARNING! n 


Aeee raa He a a o ea verre oh 
assign the values to the dummy ` variables i in the correct order. 


To M aate ee 2, —1), we replace a in the function definition with b, we replace b with 2, and ace c with 
—1. This gives us 
g(b,2,—1) = 3b — 2(2) + 7(-1)? = 3b + 3. 


So, we seek the value of b such that 3b + 3 = 21, which gives us b = 6. Checking our answer, we find that 
g(6,2,—1) = 21, as desired. 


Problem 1.3: Suppose t(x) = axt + bx? + x + 5, where a and b are constants. Find (4) if t(—4) = 


Solution for Problem 1.3: We have 
t(4) = a(4)* + b(4 +4 +5 = 256a + 16b +4 +5 = 256a + 16b + 9. 


If we can find a and b, then we can evaluate t(4). We turn to the only other piece of information we have, which is 
t(—4) = 3. We have 


t(—4) = a(—4)* + b(-4)* + (-4) +5 = 256a + 16b — 4 + 5 = 256a + 16b +1, 


so t(—4) = 3 gives us 256a + 16b + 1 = 3, which means 256a + 16b = 2. Unfortunately, this doesn’t tell us a or b. 
However, looking back at our expression for t(4), we see that we don’t need a and b. We need 256a + 16b, which 
we know equals 2. So, we have #(4) = 256a + 16b +9 = 2 +9 = 11. O 


_ Keep your eye on the ball. Sometimes we don’t need to evaluate every variable 
in a problem in order to solve the problem. 


Concept: — 


Now that we have a little practice evaluating functions, let’s try finding the domain and range of specific 
functions. 


Problem 1.4: Find the domain and range of each of the following functions: 


(a) fl) =2x-3 © sa 5 


(b) f(x) = V-2x4+7 (d) h(x) = 9x2 +4 


1.1. FUNCTION BASICS 


Solution for Problem 1.4: 


(a) 


(b) 


(c) 


We can input any real number to f, so the domain of f is all real numbers. It appears that any real number 
can be output from f. To show that the range of f is all real numbers, we show that for any real number y, 
we can find an x such that f(x) = y. Since f(x) = 2x — 3, we seek the value of x such that 2x — 3 = y. Solving 
for x gives x = (y + 3)/2. So, for any real number y, if we let x = (y + 3)/2, then we have f(x) = y. This tells 
us that every real number is in the range of f. Therefore, both the domain and range are all real numbers. 


Since we can use the symbol R to indicate the real numbers, we can write, “The domain and range of f are 
both R.” 


The square root of a negative number is not a real number, so we must have —2x + 7 > 0. This gives us 


x < 7/2, so the domain is all real numbers less than or equal to 7/2. In interval notation, the domain is 


Intuitively, we might guess that because —2x + 7 can equal any nonnegative number, the range of f is all 
nonnegative real numbers. We can explicitly show that this is the range of f as we did in part (a). We let 


y = f(x) = V-2x +7. 


We then solve this equation for x in terms of y. Squaring both sides of y = ¥—2x + 7 gives y? = —-2x+7. We 
solve for x by subtracting 7 from both sides, then dividing by —2, to find 


So, for any nonnegative value of y, if we let x = (7 — y’)/2, then we have 


pe) = 5 (754) = 2 (G4) +7- = Voy 


This final step, 4/4? = y, is only valid because y is nonnegative. We therefore see that all nonnegative real 
numbers are in the range of f, because for every nonnegative value of y, we can find an x such that f(x) = y. 
Since Y—2x +7 cannot be negative, no negative numbers are in the range of f. So, the range of f is all 
nonnegative real numbers, which we can write as [0, +00). 


We cannot have t = 1 in g(t) = 2t/(t — 1), since this would make the denominator equal to 0. There are no 
other restrictions on the input to g, so the domain is all real numbers except 1. 


Finding the range of g is a little trickier. We let y = g(t), so we have 


a2 
Cah 


Then we solve for t in terms of y. Multiplying both sides by t — 1 gives yt — y = 2t. Solving this equation 
for t in terms of y gives 
Yy 
pae 
YZ 

(Note that t cannot be 1 in this equation, since f = 1 gives us 1 = Ta from which we have y — 2 = y, which 
has no solution for y.) For any desired output y except y = 2, we can use t = -5 to find the input, t, to g(t) 
that will produce the desired output. Therefore, the range is all real numbers except 2. We can denote “all 
real numbers except 2” with interval notation as (—09, 2) U (2, +00). 


The “U” in (—œ, 2) U (2, +œ) means “or,” so, “y E€ (—09, 2) U (2, +00)” means 


y is in the interval (—os, 2) or the interval (2, +00). 
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The union of two intervals consists of all numbers that are in either one or both 
of the intervals. As we just saw, we use the symbol U to denote a union of two 
„intervals, so (—00,2) U (2,+00) means “all numbers in the interval (~, 2) or the 
interval (2, +00).” 
The intersection of two intervals consists of all numbers that are in both of the 
intervals. We use the symbol N to refer to the intersection of two intervals. For. 
example, the expression [3,7] N [5,11] refers to the numbers that are in both the 
interval [3,7] and the interval [5,11]. These numbers form the interval [5,7], so we 
can write 


Sidenote: 


13,710 [5,11] = [5,7]. 


On the other hand, the numbers that are either in [3,7] or in [5,11] (or in both) 
form the interval [3, 11]. Therefore, we have 


| - o 8,714 6,11) = B11). : | 


We also have a special notation for “all real numbers except a few specific values.” We can write “all 
real numbers except 2” as IR\{2}. If we wish to exclude several specific values, we just list them inside the 
curly braces. So, we write “all real numbers except 5, 6, and 7” as IR\{5, 6,7}. 


| 


(d) We can input any real number to the function h(x) = 9x? + 4, so the domain of h is R. Because the square of 
a real number is always nonnegative, the expression 9x? is always nonnegative. Therefore, the expression 
9x? + 4 must be greater than or equal to 4. Since 9x? + 4 can equal any number that is greater than or equal 
to 4, the range of h is [4, +00). 


o 


Note that in parts (b) and (c), we find the domain by considering operations that we are not able to perform. 
Specifically, we cannot take the square root of a negative number in a real function, and we cannot divide by zero. 
This is typically how we find the domain when it is not obvious that the domain is all real numbers, as it is in 
parts (a) and (d). Finding the range is often a matter of considering the possible outputs of special expressions like 
square roots, perfect squares, or absolute values. But, as shown in part (c), the restrictions on the range can also be 
more subtle. There, we found a strategy that is often useful for determining the range of complicated functions: 


Concept: We can often find the range of a function by setting the function definition equal 
to a new variable, and then solving for the dummy variable in terms of the new 
variable. 


1 


For some functions, we need the advanced tools of calculus to determine the range. For still others, the best 
we can do is estimate the range numerically, usually using a computer. When we ask for the range of a function 
in this text, we'll only consider functions whose ranges we can find without these techniques. 


Sometimes functions are defined with explicit constraints on the domain of the function. The range of such a 
function consists only of those outputs that can be obtained from the permitted inputs. Here’s an example: 


Problem 1.5: Suppose p(x) = 2 Vx + 3, but that p(x) is only defined for 4 < x< 9. (In other words, the domain 
of p is all real numbers from 4 to 9.) Find the range of p. 


Solution for Problem 1.5: What’s wrong with this solution: 


Bogus Solution: The output of x can be any nonnegative real number, so the result of 
ale 2 Vx + 3 can be any nonnegative real number greater than or equal to 3. 


pestered litte 
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Our Bogus Solution completely ignores the restriction on the domain of p. 


WARNING! We must take into account any restrictions on the domain of a function when 


finding the range of the function. 


Since p is only defined for inputs x such that 4 < x < 9, the range consists of all values of p that can result from 
inputting a value of x such that 4 < x < 9. Because 4 < x < 9, we have 2 < yx < 3. Multiplying all parts of this 


inequality chain by 2 gives 4 < 2 yx < 6 and adding 3 to all parts of this chain gives 7 < 2 yx +3 < 9. Therefore, 
for all x such that 4 < x < 9, we have7 < p(x) < 9. 


We're not finished yet! We've only shown that all values in the range of p are between 7 and 9, inclusive 
(which means 7 and 9 are included). We haven't shown that all real numbers from 7 to 9 are in the range. This is 
somewhat “obvious,” since p(x) goes “smoothly” from 7 to 9 as x goes from 4 to 9. However, to explicitly prove 
that all possible values from 7 to 9 are in the range, we let y = p(x) = 2 Vx + 3 and solve for x in terms of y. This 


gives us 
= a 
-=| 5 : 


For any value of y from 7 to 9, we can use this equation to find the value of x for which p(x) equals this y. (Make 
sure you see why all the resulting values of x are in the domain of p.) O 


Finding the domain and range of more complicated functions can be pretty tricky. We’ll try finding the domain 
of a couple of such functions. 


Problem 1.6: Find the domain of each of the following functions: 
| @) w= 
(b) g(t) = 


a 
t 


Solution for Problem 1.6: 


(a) The input to a square root must be nonnegative, so we must have 2x—5 2 0. This gives us x = 5/2. However, 
we cannot have x = 3, since this will make the denominator equal to 0. 


WARNING!! Sometimes there is more than one constraint on the domain of a function. 
re 


Combining these two constraints tells us that the domain is all real numbers greater than or equal to 5/2 
except 3. We can write this in interval notation as [5/2, 3) U (3, +0). 


(b) First, we notice that we cannot have t = 0 or 3t — 4 = 0, because we cannot divide by 0. This tells us that we 
must exclude t = 0 and t = 4/3 from the domain. However, we're not finished, because we cannot have the 
entire denominator of g(t) equal to 0. Therefore, we must exclude from our domain those values of t for 


which 1 i 1 : 

t 3f-4 
Multiplying both sides by #(3t — 4) to get rid of the fractions gives us 3f — 4 — t = 0, from which we find 
t = 2. So, the domain of g is all real numbers except 0, 4/3, and 2. We could write this as intervals with the 
a (—œ,0)U (0, $) U G, 2) U (2, +00). We can also write it as R\{0, 4 3, 2}, Which reads “all real numbers 


except 0, 3, and2. 4 
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Te (x —1)(x) © 
-1 x—1 


= N E = = yti roe =% Te same SRE : 


i P : j 2 
Problem 1.7: Notice that - 


Solution for Problem 1.7: No! The function f is not defined for x = 1, even though it appears that we can simplify 
our expression for f(x) to just x. The simplification 


xX-x  x(x-1) _ 
Pol gsl 


is only valid if x -lolix =i) then ==% rae 2, which is not defined. In other words, 1 is not in the domain of f, but 
it is in the domain of g. So, f and g are not the same function. O 


We finish this section with the relatively obvious observation that the sum, product, and quotient of two 
functions are all themselves functions. For example, if f and g are functions, we can define a new function h as 
h = f + g, where this means that h(x) = f(x) + g(x). Note that h(x) is only defined if both f(x) and g(x) are defined. 
Therefore, a number must be in the domains of both f and g in order to be in the domain of f + g. 


In much the same way, we can define a new function as a linear combination of any two functions f and g. 
Specifically, if a and b are constants, then we can define a new function t as t = a: f + b- g, where this means that 


t(x) =a- f(x) + D- g(x). 


We can also define the function p = f - g such that p(x) = f(x) - g(x) and the function q = f/g such that 
q(x) = f(x)/g(x). Note that if q = f/g, then we must exclude values of x for which g(x) = 0 from the domain of g. 


oo 
1.1.1 Let f(x) =x*7-x-6. 
(a) Compute f(2). (d) Is there a real value of x such that f(x) = 6? 
(b) Find all x such that f(x) = 0. (e) Find the range of f. Hints: 1 


(c) Find f(x- 1). 
1.1.2 Find the domain and range of each of the following real-valued functions: 


(a) f(x) =[1—-4 (b) f(t)=v2-¢ (c) AQ) =1-x (dq) gt) = a7 


1.1.3 Let f(x) = V4- x and g(x) = y2x — 6. Find the domain of f - g and the domain of f/g. Explain why these 
domains are different. 


[2x —5 V2x — 
1.1.4 Do the functions f(x) = = k and g(x) = = have the same domain? Why or why not? 
Be rm 


1.1.5 Let f(x) = ma Suppose |x| + 1. Simplify f(x) - f(~x). 
1.1.6 A function g(x) is defined for —3 < x < 4 such that g(x) = (2+ x)’. Find the range of g. 
1.1.7 Let T(a,b,c) = 3a" —c. 


(a) Find T(2,3,-5). 
(b) Find all values of x such that T(x, 2,6) = 


1.2. GRAPHING FUNCTIONS 


1.2 Graphing Functions 


When we graph an equation in which the only variables are x and/or y, we plot all points (x, y) on the Cartesian 
plane that satisfy the equation. When we graph a function f, we graph the equation y = f(x). For example, the 
graph of the function f(x) = x? is shown at right. 


The x-intercepts of a graph are the points where the graph crosses the x-axis, while the y-intercepts are the 
points where the graph crosses the y-axis. For example, the point (0, 0) is both an x-intercept and a y-intercept of 
the graph of f(x) = x°. (Note: Some sources use “x-intercept” to refer to the x-coordinate of a point where a graph 
crosses the x-axis, and likewise for “y-intercept.” So, these sources would describe both the x-intercept and the 
y-intercept of the graph of f(x) = x? as 0 instead of (0, 0).) 


In this section, we review the basics of graphing functions, getting information from the graph of a function, 
and transforming the graph of a function. 


Problems pe- 


Problem 1.8: For each of the functions below, graph the function and find the x-intercepts and the y-intercepts 
of the graph. 


(a) f@)=(x-6-4 () f)=ver5-3 
(b) f(x) =4 (a) f(@) =|r+2/41 


Problem 1.9: The graph of y = f(x) is pictured at right. 
(a) According to the graph, what is f(2)? 
(b) According to the graph, for what values of a is f(a) = 2? 


Problem 1.10: 
(a) Is it possible for a vertical line to intersect the graph of a function at two different points? Why or why 


not? 
If there is no vertical line that passes through more than one point of a curve on the Cartesian plane, then 


is the curve the graph of a function? 
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Problem 1.11: At right is the graph of y = f(x). 


(a) Explain why the-graph of y = f(x) +2 is a 2-unit upward shift of the 
graph of y = f(x). 

(b) Explain why the graph of y = f(x + 2) is a 2-unit leftward shift of the 
graph of y = f(x). 

(c) Find the domain and range of h if h(x) = f(x +3) —4. 


Problem 1.12: Let f(x) be the function graphed in the previous problem. 
(a) Find the graph of y = — f(z). 
(b) Find the graph of y = f(—x). 
(c) Find the graph of y = 2f(x). 
(d) Find the graph of y = f(2x). 


| Problem 1.8: For each of the functions below, graph the function and find the x-intercepts and the y-intercepts 
of the graph. 


(a) f(x)=(x-6)P -4 (c) f(x)=vx+5—-3 
(b) f(x)=4 (d) f(x) =|x+2)+1 


Solution for Problem 1.8: 


(a) The graph of y = (x — 6)? — 4 is an upward-opening parabola with vertex (6, —4). We find several points on 
the graph by finding y for various values of x. A table with these points is at left below, and the graph is at 
right below. 


x 
3 
4 
5 
6 
7 
8 
9 


BEA 

The y-intercept of the graph is the point where the graph intersects the y-axis. All points on the y-axis 

have an x-coordinate of 0, so evaluating f(0) gives us the y-coordinate of the y-intercept of the graph of 
y = f(x). Since f(0) = (0 — 6)? — 4 = 32, the y-intercept of the graph is (0, 32). 

The x-intercepts of the graph are those points where the graph intersects the x-axis. The y-coordinate of 
such a point is 0, so the x-coordinates of the x-intercepts are the solutions to the equation f(x) = 0. Therefore, 
we seek the values of x such that (x — 6)? — 4 = 0. Expanding and rearranging gives us x? — 12x + 32 = 0. 
Factoring gives (x — 4)(x — 8) = 0, so our solutions are x = 4 and x = 8. Therefore, the x-intercepts of our 
graph are (4,0) and (8,0). (Notice that we could also read these off our graph.) 
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(b) Since f(x) = 4, the graph of y = f(x) is the graph of y = 4, which is a horizontal y 
line as shown at right. This line meets the y-axis at (0, 4), so this is the y-intercept. io 
The graph never hits the x-axis, so it does not have an x-intercept. 


Seen + 4 ie. 
i teat o 


(c) As with the first part, we choose values of x, and then find y such that y = f(x). Nate: that we start with 
x = —5 and then substitute greater values of x, because the domain of f is [—5, +00). 


Py | TETEE 


When x = 0, we have y = f(0) = -3 + V5, so the y-intercept is (0,-3 + V5). When y = 0, we have 
0 = yx+5-3. Adding 3 to both sides gives 3 = yx +5, and squaring both sides of this gives 9 = x + 5. 
Solving this gives x = 4, so (4, 0) is the x-intercept of the graph. 

(d) Ifx > —2, we have |x + 2| = x + 2, so if x > —2, we have f(x) =x+2+1=x+3. 
So, when graphing y = f(x), we graph y = x + 3 for x > —2. Similarly, if x < —2, 
we have |x + 2| = —x —2, so graphing y = f(x) for x < —2 is the same as graphing 
y = -x-2 +1 = -x - 1. The result is shown at right. 

When x = 0, we have y = f(0) = 3, so (0,3) is the y-intercept of the graph. 
When y = 0, we have 0 = |x + 2| + 1. Subtracting 1 gives |x + 2| = —1, which has 
no solutions because absolute value is always nonnegative. So, the graph has 
no x-intercept. (We can also see this from our graph.) 


o 


Now that we’ve used the definition of a function to get information about the graph of the function, let’s try 
going the other direction—using a graph to get information about the function. 


Problem 1.9: The graph of y = f(x) is pictured at right. 
(a) According to the graph, what is f(2)? 
(b) According to the graph, for what values of a is f(a) = 2? 


Solution for Problem 1.9: 
(a) The graph of y = f(x) passes through (2,3), so we have f(2) = 3. 
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‘Important: If the point (a,b) is on the graph of y = f(x), then f(a) = b. Conversely, if 


T f(a) = b, then (a, b) is on the graph of y = f(x). 


(b) To determine the TE F a such that f(a) = 2, we locate the points on the graph of y = f(x) such that y = 2. 
The x-coordinates of these points then are our desired values of a. Since (0,2) is on the graph of y = f(x), 
we have f(0) = 


There is another point on the graph with y-coordinate equal to 2, but it’s not immediately obvious what 
the x-coordinate of this point is. However, this point is on the line segment connecting (2, 3) to (3,0), so we 
can use the equation of the line through these two points to find the desired x-coordinate. The slope of this 
line is (3 — 0)/(2 — 3) = —3, so a point-slope equation of the line is y — 0 = —3(x — 3), or y = —3x + 9. When 
y = 2 in this equation, we have x = 7/3, so the line passes through (7/3, 2). Therefore, we have f(7/3) = 2, 
so the two values of a for which f(a) = 2 area = 0 anda = 7/3. 


Problem 1.10: 
(a) Isit possible for a vertical line to intersect the graph at a function at a different points? Why or why f 
not? 


(b) If there is no vertical line that passes through more than one point of a curve on the Cartesian plane, 
then is the curve the graph of a function? 


Solution for Problem 1.10: 


(a) For each value of x in the domain of f, there is exactly one value of f(x). So, for each such value of x, there 
is only one point (x, y) on the graph of y = f(x). Therefore, it is impossible for the graph of a function to 
pass through two points with the same x-coordinate. Because the equation of any vertical line has the form 
x = a for some constant 4, it is impossible for a vertical line to pass through more than one point on the 
graph of a function. 


(b) If there is not a vertical line that passes through more than one point of a graph, then for every value of x, 
there is at most one value of y such that (x, y) is on the graph. In other words, for every x for which there is 
a point (x, y) on the graph, there is exactly one corresponding value of y, so the graph does indeed represent 
a function. 


‘Important: A curve on the Cartesian plane is the graph of some function if and only if every 
vertical line passes through no more than one point on the curve. We call this 
the vertical line test. 


Problem 1.11: At right is the graph of y = f(x). 
Explain why the graph of y = f(x) + 2 is a 2-unit upward shift of the 
graph of y = f(x). 
Explain why the graph of y = f(x + 2) is a 2-unit leftward shift of the 


graph of y = f(x). 
Find the domain and range of h if h(x) = f(x + 3) — 4. 


1.2. GRAPHING FUNCTIONS 


a er hhh iE esses 


Solution for Problem 1.11: 


(a) 


(b) 


Suppose x = a, where a is in the domain of f. Then, the point (a, f (a)) is on 
the graph of y = f(x) and the point (a, f(a) + 2) is on the graph of y = f(x) +2. 
Therefore, every point of the graph of y = f(x) + 2 is 2 units above the 
corresponding point on the graph of y = f(x). So, the graph of y = f(x) +2 
is a 2-unit upward shift of the graph of y = f(x). The graph of y = f(x) +2 is 
solid in the graph at right, and the graph of y = f(x) is dashed. 

Similarly, if k is positive, then the graph of 


Y= jotk 
is a k-unit upward shift of the graph of y = f(x), and the graph of 


y = f(x) —k 
is a k-unit downward shift of the graph of y = f(x). 


Suppose x = a, where a is in the domain of f. Then, the point (a, f(a)) is on 
the graph of y = f(x) and the point (a, f(a + 2)) is on the graph of y = f(x+2). 
Unfortunately, it isn’t so clear how f(a) and f(a+2) are related. To get a point 
on the graph of y = f(x + 2) with y-coordinate equal to f(a), we have to let 
x=a-2. When x =4a-2in y= f(x +2), we have y = f(a), so (a — 2, f(a) is 
on the graph of y = f(x +2). 

We've shown that for any a in the domain of f, the point (a, f (a)) is on the 
graph of y = f(x) and the point (a — 2, f(a)) is on the graph of y = f(x + 2). 
Therefore, every point on the graph of y = f(x + 2) is 2 units to the left of the 
corresponding point on the graph of y = f(x). So, the graph of y = f(x + 2) 
is a 2-unit leftward shift of the graph of y = f(x). The graph of y = f(x +2) 
is solid in the graph at right above, and the graph of y = f(x) is dashed. 


Similarly, if k > 0, then the graph of 


y = fx+hk) 

is a k-unit leftward shift of the graph of y = f(x) and the graph of 
areal Camel 

is a k-unit rightward shift of the graph of y = f(x). 


A common mistake is to think that the graph of y= f(x+k isa rightward. 
shift of the graph of y = f(x) when k is positive. This is incorrect; make sure 
you see why. 


-WARNING!! 
hà 


The graph of y = f(x +3) —4 is the result of shifting the graph of y = f(x) downward 4 units and leftward 3 
units. The leftward shift means the domain of h is 3 units “to the left” of the domain of f. The domain of 
f is [-5,3], so the domain of h is [-8,0]. The downward shift means the range of h is 4 units “below” the 
range of f. The range of f is [-3,4], so the range of h is [-7, 0]. 


In Problem 1.11, we learned the effect of adding a constant to the input or to the output of a function: 


When k > 0, the graph of y = f(x) +k results from shifting the graph of y = f(x) 
vertically upward by k units, while the graph of y = f(x +k) results from shifting 
the graph of y = f(x) horizontally to the left by k units. Similarly, the graph of | 
y = f(x) —kis a k-unit downward shift of the graph of y = f(x) and the graph 
of y = f(x — k) is a k-unit rightward shift. 


Important: 
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Problem 1.12: At right is the graph of y = f(x). 
(a) Find the graph of y = -f (x). 
(b) Find the graph of y = f(-x). 
(c) Find the graph of y = 2f(x). 
(d) Find the graph of y = f(2x). 


Solution for Problem 1.12: 


(a) As before, we let x = a, where a is in the domain of f. Then, the point (a, f(a)) is on the graph of y = f(x) and 
the point (a, — f(a)) is on the graph of y = -f (x). So, the y-coordinate of each point on the graph of y = — f(x) 
is the opposite of the y-coordinate of the corresponding point on the graph of y = f(x). Therefore, we form 
the graph of y = -f (x) by reflecting the graph of y = f(x) over the x-axis. Below, the graph of y = —f(x) is 
solid and the graph of y = f(x) is dashed. 


(b) We let x = a, where a is in the domain of f. Then, the point (a, f(a)) is on the graph of y = f(x) and the 
point (a, f(—a)) is on the graph of y = f(-x). We can find a point on the graph of y = f(—x) with f(a) as 
the y-coordinate by letting x = —a, which tells us that (—a, f(a)) is on the graph of y = f(—x). Therefore, 
the graph of y = f(—x) is the result of reflecting the graph of y = f(x) over the y-axis. Below, the graph of 
y = f(-x) is solid and the graph of y = f(x) is dashed. 


‘Important: — The graph of y = -f (x) is the result of reflecting the graph of y = f(x) over the 
X-axis. | 
The graph of y = f(—x) is the result of reflecting the graph of y = f(x) over the 
y-axis. | 


1.2. GRAPHING FUNCTIONS 


(c) For each point (a, f(a)) on the graph of y = f(x), the point (a,2f(a)) is on 
the graph of y = 2f(x). The latter point is twice as far from the x-axis as 
the former. Similarly, each point on the graph of y = 2 f(x) can be found 
by doubling the distance from the x-axis of the corresponding point on the 
graph of y = f(x). Therefore, the graph of y = 2f(x) can be found by scaling 
the graph of y = f(x) vertically away from the x-axis by a factor of 2. At 
right, the graph of y = 2f(x) is solid and the graph of y = f(x) is dashed. 


(d) For each point (2a, f(2a)) on the graph of y = f(x), the point (a, f(2a)) is on 
the graph of y = f(2x). The latter point is half as far from the y-axis as the 
former. Similarly, each point on the graph of y = f(2x) can be found by 
halving the distance from the y-axis of the corresponding point on the graph 
of y = f(x). Therefore, the graph of y = f(2x) can be found by scaling the 
graph of y = f(x) horizontally relative to the y-axis by a factor of 1/2. At 
right, the graph of y = f(2x) is solid and the graph of y = f(x) is dashed. 


eens eeacad 


RAET 


i H 


L RS Is 


When k > 0, the graph of y = kf(x) results from scaling the graph of y = f(x) 
vertically relative to the x-axis by a factor of k, while the graph of y = f(kx). 
results from scaling the graph of y = f(x) horizontally relative to the y-axis by a 
factor of 1/k. 


WARNING!! The graph of y = f(kx) is a horizontal scaling of the graph of y = f(x) by a 
ye factor of 1/k, not by k. So, for example, the graph of y = f(3x) is the result of 


compressing the graph of y = f(x) towards the y-axis, not stretching it away 
from the y-axis. 


[ytxercises Ba 


1.2.1 Graph each of the following, and find the x-intercepts and y-intercepts of each graph. 


(a) 


f(x) =3x-7 (b) f(x) =2- ixl (c) f(x)=x-5x+6 


1.2.2 Determine if each of the following graphs represents a function: 
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1.2.3 Suppose f(9) = 2. For each of the parts below, find a point that must be on the graph of the given equation. 


(a) y=f(x—3)+5 
(b) y=2f(x/4) 
© yaf (er Fa 7 


1.2.4x Suppose f(4) = 8. Note that the point (2,8) is on the graph of y = f(2x) and that the point (5,8) is on 
the graph of y = f(2x — 6). So, the point we have found on the graph of y = f(2x — 6) is 3 units to the right of 
the point we found on the graph of y = f(2x). Which of the following is true: (a) we made a mistake; (b) it’s a 
coincidence—not every point on y = f(2x — 6) is 3 to the right of a point on y = f(2x); (c) the graph of y = f(2x — 6) 
is a 3-unit rightward shift of the graph of y = f(2x). Explain your answer. Hints: 53 


1.3 Composition 


Sometimes we want to hook two (or more) machines together, taking the output from one machine and putting 
it into another. When we connect functions together like this, we are performing a composition of the functions. 
For example, the expression f(g(x)) means we put our input x into function g, and then take the output, g(x), and 
put that into function f. A composition of functions is also sometimes indicated with the symbol o. For example, 
when we write h = f o g, we define the function h such that h(x) = f(g(x)). 


| Problems ag 


Problem 1.13: Suppose f(x) = 3x +7 and g(x) = 2 Vx -3. 
(a) Find f(g(3)) and g(f(3)). 

(b) Find aif f(g(a)) = 25. 

(c) Is ge(f(-7)) defined? Why or why not? 

(d) Suppose that h = go f. What is the domain of h? 


Problem 1.14: ee cae — 

(a) Let h(x) = f(g(x)), where f and g are functions, and suppose h(x) is defined for all values of x in the 
domain of g. Explain why the range of g must be part of the domain of f. 

(b) Again, let h(x) = f(g(x)), where f and g are functions. Suppose every value in the range of g is in the 

domain of f. Must the domain of h be the same as the domain of g? 


Problem 1.15: Let f(a) = a — 2 and F(a, b) = b +a. Find eS f(4)). (Source: AHSME) 


Problem 1.16: | 
(a) Let f(x) = 3x—1. Find f(f(x)) and FAF). | 
(b) Suppose g(x) = ax + b, where a and b are real constants. Find all possible pairs (a,b) such that we have 

g(g(x)) = 9x + 28. 


Problem 1.17: Let f(x) = yx and g) = x°. Is it true that f(g(x)) and g(f(x)) are the same function? Why or why 


not? 
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Problem 1.13: Suppose f(x) = 3x +7 and g(x) = 2 Vx — 3. 
(a) Find f(g(3)) and g(f(3)). 
(b) Find aif f(g(a)) = 25. 

(c) Is g(f(-7)) defined? Why or why not? 

(d) 


Suppose that h = go f. What is the domain of h? 


Solution for Problem 1.13: 


(a) We have 9(3) = 2 V3 -3 = 0, so f(9(3)) = f(0) = 3(0) + 7 = 7. Similarly, we have f(3) = 3(3) + 7 = 16, so 
g(f(3)) = g(16) = 2 V16 -3 = 2 V13. Notice that f(9(3)) + ¢(f(3)). 


WARNING}! If f(x) and g(x) are functions, then the functions f(9(x)) and g(f(x)) are not 
a 


necessarily the same function. 


(b) Wehave 
f(g@)) = f(2 Va — 3) = 3(2 Va - 3) +7 = 6 Va -347, 


so f(g(a)) = 25 means 
6Va-3+7 = 25. 


Subtracting 7 from both sides and then dividing both sides by 6 gives Va — 3 = 3. Squaring both sides gives 
usa — 3 = 9, so a = 12. Checking our answer, we find f(g(12)) = f(6) = 25, as expected. 

(c) We have f(-7) = 3(-7) +7 = —-14, so 9(f(—7)) = g(-14). However, the value —14 is not in the domain of g. 
Since g(—14) is not defined, g(f(—7)) is not defined. 

(d) Since h(x) = g(f(x)), in order for x to be in the domain of h, both f(x) and g(f(x)) must be defined. The 
domain of f is all real numbers; however, as we saw in the previous part, g(f(x)) is not defined for all real 
values of x. 

In order for g(f(x)) to be defined, the value of f(x) must be in the domain of g. The domain of g is 
[3,+00). Therefore, in order for g(f(x)) to be defined, we must have f(x) = 3. So, we must have 3x + 7 > 3, 
which gives us x > —4/3. For all values of x such that x > —4/3, the expression g(f(x)) is defined. For any 
value of x less than —4/3, the expression g(f(x)) is not defined. Therefore, the domain of h is [—4/3, +00). 


o 


| Problem 1.14: 

(a) Let h(x) = f(g(x)), where f and g are functions, and suppose h(x) is defined for all values of x in the 
domain of g. Explain why the range of g must be part of the domain of f. 

(b) Again, let h(x) = f(g(x)), where f and g are functions. Suppose every value in the range of g is in the 

domain of f. Must the domain of h be the same as the domain of g? 


Solution for Problem 1.14: 


(a) Since h(x) = f(g(x)) and h(x) is defined for all values of x in the domain of g, the expression f(g(x)) must be 
defined for all values of x in the domain of g. Because f(ẹ(x)) is defined, the value of g(x) is in the domain 
of f. So, every number in the range of g must be in the domain of f. 


(b) The expression f(g(x)) is not defined if g(x) is not defined. Therefore, all values in the domain of h must be 
in the domain of g. Conversely, if x is in the domain of g, then g(x) is defined. Furthermore, we are given 
that every value in the range of g is in the domain of f, so f(g(x)) is defined for all x in the domain of g. 
Therefore, every value in the domain of g is in the domain of h. Since every value in the domain of g is in 
the domain of h, and vice versa, we conclude that the domain of h is the same as the domain of g. O 
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Problem 1.15: If f(a) = a — 2 and F(a, b) = b? + a, find the value of F(3, f(4)). (Source: AHSME) ~ 


Solution for Problem 1.15: We note that f(4) = 4-2 = 2, so 


EG, f(4)) = FG 2) = 2? +3 =7. 


Although the expression F(3, f(4)) looks a little intimidating, the process of evaluating it is just simple plug- 
and-chug arithmetic. 


Concept: 


We'll see the value of notation as we move to more complicated problems throughout this book. However, 
we'll usually introduce notation with simple problems like Problem 1.15 so you can get used to the notation before 
using it on harder problems. 


Don’tbe intimidated by notation. Often the notation is a fancy way of representing 
ry simpler cec ? 


Problem 1.16: 
(a) Let f(x) = 3x- 1. Find f(f(x)) and f (x). 


(b) Suppose g(x) = ax + b, where a and b are real constants. Find all possible pairs (a, b) such that we have 
g(e(x)) = 9x + 28. 


Solution for Problem 1.16: 


(a) We have 
Ff@) = fx- 1) = 368x- 1) -1 = 9x- 4. 


We can now use this result to find f(f(f(x))) by inputting f(f(x)), which equals 9x — 4, into f(x): 
SfE) = f(9x - 4) = 4) — 1 = 27x — 13. 
(b) In order to find a and b, we first need an expression for ẹ(g(x))}. We have 
g(e(x)) = g(ax + b) = a(ax + b) +b = a’x +ab +b. 
So, our equation g(g(x)) = 9x + 28 is now a?°x + ab + b = 9x + 28. In order for this equation to be true for all 


values of x, the coefficient of x on both sides must be the same and the constant terms on both sides must 
be the same. Therefore, we have the system of equations 


a’ =9, 
ab + b = 28. 
From a* = 9, we have a = +3. Letting a = 3 in the second equation gives b = 7 and letting a = —3 in the 


second equation gives b = —14. So, our two possible pairs are (a, b) = (3,7) and (a, b) = (-3, -14). 
0 


We have a special notation for repeatedly feeding the output of a function back into the function itself. In this 
text, we will often write f° (x) to mean f(f(x)). Likewise, f” (x) usually refers to applying the function f(x) exactly 
n times; for example, f°(x) = f(f(f(f(f(«))))). 
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Some sources use f(x) to mean f(x): f(x) instead of f(f(x)). Likewise, f"(x) is 
sometimes used to denote [ f(x)]". You'll often have to determine by context 
whether f"(x) refers to a function applied n times, or to a function raised to’ 
the n™ power. : 


WARNING! 
ae 


Problem 1.17: Let f(x) = Vx and g(x) = x’. Is it true that f(g(x)) and 9(f(x)) are the same function? Why or 
why not? 


Solution for Problem 1.17: At first glance, we might be tempted to reason as follows: 


Bogus Solution: We have f(g(x)) = RE = TEET gF) = (Vx? = oe F Fe) RETI eF) 


are the same. 


This Bogus Solution contains two significant errors. First, the domain of f is all nonnegative numbers, so the 
domain of g(f(x)) is all nonnegative numbers. However, because the domain of g is all reals while the range of g is 
all nonnegative numbers, the expression f(g(x)) is defined for all real x. Therefore, the domain of f(g(x)) is all real 
numbers. So, the two functions g(f(x)) and f(¢(x)) have different domains, which means that the functions are 
not the same. For example, we have f(g(—1)) = f(1) = 1, but g(f(—1)) is undefined, because f(—1) is undefined. 


The other error occurs when we write Vx? = x. This statement is only true if x > 0. If x < 0, then we have 
Væ = —x. So, it is not true that f(g(x)) = x for all x. If x > 0, then f(g(x)) = x, and if x < 0, then f(g(x)) = —x. We 
can capture both of these by writing f(9(x)) = |x|. 5 


Exercises > 


1.3.1 Check the answer to part (b) of Problem 1.16 by evaluating 9(9(x)) for g(x) = 3x +7 and for g(x) = —3x — 14. 
Me (ja — 2¢ and g(x) = Vl =x. 


(a) Find f(g(x)). 
(b) Find the domain of f(g(x)). 
(c) Is g(f(—2)) defined? Why or why not? 


1.3.3 Are both f(f?(x)) and f*(f(x)) the same as f?(x)? 


1.3.4 Leta be a constant, and let f(x) = ax. Find an expression for f"(x) in terms of a and n, where n is a positive 
integer. 


1.3.5 Suppose h, f, and g are functions such that h(x) = f(9(x)). If (3,0) is the only x-intercept of the graph of g, 
then can we determine the x-intercepts or y-intercepts of h? 


1.4 Inverse Functions 


You're probably very familiar with using Control-Z when typing on a computer: that’s the “undo” command 
for many applications. Being able to “undo” the work of a function can be extremely useful. The machine that 


“undoes” the work of a function f is called the inverse function of f. 
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Definition: Function g is the inverse of function f if 


g(f(x)) = x for all values of x in the domain of f, and 
f(g(x)) = x for all values of x in the domain of g. 


From our definition, we see that if g is the inverse function of f, then f is the inverse function of g. Moreover, 
the range of g is the domain of f, and vice versa. 


As we will see, not every function has an inverse. However, if a function has an inverse, then the inverse is 
unique, as our definition implies. We can see this by noting that if ¢ and h are both inverses of f, then we have 
g(f(x)) = x and h(f(x)) = x, so g(f(x)) = h(f(x) for all x in the domain of f. Letting y = f(x), we have g(y) = h(y) 
for all y in the domain of g and h, so g and h are the same function. 


We have a special notation to denote the inverse of a function. We often write the inverse of f as f~!. 


WARNING!! When working with functions, fo does not mean rs The expression f~ is a 
a special notation that denotes the inverse of the function f. 


Problems 


ee naana e e aea ee: 


Problem 1.18: In this problem we find the inverse of the function f(x) = 3x -5. 


(a) Let gbe the inverse of f, so that we have f(g(x)) = x and ¢(f(x)) = x. Which of these equates is easier 
to use to find g(x)? 


(b) Use the definition of f to solve the equation f(y) = x for y. 
(c) What is g(x)? 
Check your answer by confirming that g(f(x)) = x. 


Problem 1.19: Let f(x) = z = 


(a) Find f-1(7), without finding f(x). 
(b) Find f-?(x). 


Problem 1.20: 
(a) Does the function g(x) = 2x* — 9 have an inverse? If it does, find it. If it doesn’t, explain why it doesn’t. 


(b) Does the function h(x) = 3 ¥2x + 1 have an inverse? If it does, find it. If it doesn’t, explain why it doesn’t. 
(Be careful on this part!) 


Problem 1.21: 
- (a) Explain how to determine from the graph of a function whether or not the function has an inverse. 
(b) How are the graph of a function and the graph of its inverse related? 


Problem 1.22: Suppose functions f and g both have inverses. Show that if h = f o g, then the inverse of h is 


ee o j 


Problem 1.18: Find the inverse of the function f(x) = 3x - 5. 
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Solution for Problem 1.18: Suppose g is the inverse of f. Then, we must have 


(fx) = flex) = x. 


The equation f(e(x)) = x is more helpful, because ME know f(x). So, we can put g(x) into f(x), which gives us 
f(g(x)) = 3g(x) - 5. Therefore, we must have 3g(x) - 5 = x. 


Important: Just as we can sometimes solve an equation for a variable by isolating the 


variable, we can sometimes solve an equation for a function by isolating that 
function. 


We isolate g(x) by adding 5 to both sides, then dividing by 3. This gives us g(x) = “2. We can check our work by 
confirming that f(g(x)) = g(f(x)) = x. Therefore, we have f~1(x) = 20 


Note that we found the inverse of f by solving the equation f(g(x)) = x for g(x). Make sure you see why this 
produces a function g that is the inverse of f. 


Important: If f has an inverse g, then we can often find that inverse by solving the equation 


Vv f(g(x)) = x for 9(x). 


If you find the idea of SPR Ris for a function” confusing, you can instead let y = g(x) and solve for y. For 
example, when f(x) = 3x — 5, we can solve for y in f(y) = x to find the inverse of f. Since f(y) = x, we have 
3y — 5 = x. Solving for y gives y = #2, so the inverse of f is f'(x) = #. 


Problem 1.19: Let f(x) = ae 


(a) Find f~1(7), without finding f +(x). 
(b) Find f~*(x). 


Solution for Problem 1.19: 


(a) Suppose y = f-1(7). Because f~! is the inverse of f, we have f(f~'(x)) = x for all values of x. Specifically, 
we have f(f~!(7)) = 7, so because y = f—'(7), we have f(y) = 7. We also could have deduced f(y) = 7 by 
noting that y = f~'(7) means that the y is the input to f that produces 7 as an output. 


Important: If function f has an inverse, then f(a)=b means that f -1(b) = a, and vice versa. 


Since Fy) = 7, we have 


=F y 
= 
oe 
Multiplying both sides by 3 — 2y gives —5 — y = 7(3 — 2y). Solving this equation gives us y = 2. Therefore, 
fies 
(b) As we just learned, we can find the inverse of f by solving the equation f(y) = x for y in terms of x. Using 
our definition of f, the equation f(y) = x becomes 


Multiplying both sides by 3 — 2y gives —5 — y = x(3 — 2y). Isolating the terms with y on the left side gives 
2xy — y = 3x + 5, so y(2x — 1) = 3x + 5. Dividing by 2x — 1 gives 


3x +5 
=i] EE 
DO A 
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With this, we can check our answer to the first part by directly evaluating f~'(7) = 2. 
m 


We say that a function is invertible if it has an inverse. As you might have guessed, not all functions are 
invertible. 


Problem 1.20: 
(a) Does the function g(x) = 2x? — 9 have an inverse? If it does, find it. If it doesn’t, explain why it doesn’t. 


(b) Does the function h(x) = 3 ¥2x + 1 have an inverse? If it does, find it. Ifit doesn’t, explain why it doesn’t. 


Solution for Problem 1.20: 


(a) Both g(1) and g(—1) equal —7. If g has an inverse, should this inverse return 1 or —1 when —7 is input? We 
can’t tell! Therefore, g is not reversible; it does not have an inverse. 


Important: If a function gives the same output for two different inputs, then the function 


Vv does not have an inverse. 


On the other hand, if a function never gives the same output for two different inputs, then the function is 
invertible—we can tell from the output of the function what input produced that output. 


Important: If there are no two different inputs to a function that produce the same output, 


Vv then the function has an inverse. 


(b) We start by looking for two different inputs to h that give the same output. After inputting a few values to 
h, we notice that as we input higher and higher values of x, we get higher and higher values of h(x). This 
makes us think that might indeed have an inverse, since it seems impossible to find two different inputs 
that give the same output. So, we try to find the inverse of h by solving the equation h(y) = x for y. This 


gives us the equation 
34/2y+1=x. 


Squaring both sides gives us 9(2y + 1) = x*. Solving for y gives 
© 1 
4-8 2° 
This isn’t the end of the story, however. Recall our definition of inverse functions. Functions f and h are 
inverse functions of each other if 


h(f(x)) = x for all values of x in the domain of f, and 
f(A(x)) = x for all values of x in the domain of h. 


We let f(x) = 5 — 4. Our work above suggests that this is the inverse of h. However, if f is the inverse of 
h, then h is the inverse of f. So, we should have h(f(x)) = x for all x in the domain of f. Unfortunately, we 
have f(—3) = 0, but h(f(—3)) = h(0) = 3, so we don’t have h(f(x)) = x when x = —3. The problem here is that 
the function f(x) = 5 — } does not have an inverse. We can see this quickly by noting that f(3) = f(-3) = 0. 


So, where did we go wrong? 


Let’s evaluate h(f(x)) and see if it really does equal x. Maybe this will shed some light on where we 


went wrong. We have 
KEVA = -1+1=3/% - Ve. 


Here, we have to be careful. 
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1.4. INVERSE FUNCTIONS 


WARNING! We only have Væ = x if x is nonnegative. | 
a 


Now, we've found our problem. We only have h(f(x)) = x if Vx = x. This only occurs if x is nonnegative. 
So, if we restrict the domain of f to nonnegative numbers, then we have h(f(x)) = x for all values of x in the 


domain of f. We can quickly confirm that f(h(x)) = x for all x in the domain of h as well. So, the inverse of 


his ht(x) = 5 — 4, where the function h™! is only defined for nonnegative values of x. 


Problem 1.21: 
(a) Explain how to determine from the graph of a function whether or not the function has an inverse. 


(b) How are the graph of a function and the graph of its inverse related? 


Solution for Problem 1.21: 


(a) 


(b) 


A function has an inverse if and only if we can always tell from the output of the function what the input 
was. If f has an inverse, then when we graph y = f(x), there can be no two points on the graph with the 
same y-coordinate. If there were, then we would have two different inputs (the x-coordinates of points on 
the graph) that provide the same output (the common y-coordinate) for the function, so it couldn’t have an 
inverse. “No two points on the graph have the same y-coordinate” means the same thing as “No horizontal 
line intersects the graph in more than one point.” Therefore, we have a horizontal line test for whether or 
not a function has an inverse: 


Important: A function has an inverse if and only if there does not exist a horizontal line 
Vv that passes through more than one point on the graph of the function. 


Suppose the point (a, b) is on the graph of y = f(x). Then, we have f(a) = b, so f-*(b) = a. Because f~! (b) = a, 
the point (b, a) is on the graph of y = f'(x). Therefore, if (a, b) is on the graph of a function, then (b,a) is on 
the graph of its inverse. 


The fact that the points on the graph of y = f~1(x) can be found by 
reversing the coordinates of the points on the graph of y = f(x) has a 
geometric interpretation. We just learned that if (a,b) is on the graph 
of y = f(x), then (b,a) is on the graph of y = f'(x). In the graph at 
right, we have plotted the points (5, 6), (6, -3), and (—7, 1). We have also 
plotted the points that result when the coordinates of these three points 
are reversed. We see that the new points are the result of reflecting the 
old points over the line y = x. The graph of y = x is the dashed line in 
the diagram at right. 


In general, reversing the coordinates of a point has the same effect as 
reflecting the point over the graph of y = x. To prove this, we let point 
P be (a,b) and Q be (b,a). The slope of PQ is —1, so PQ is perpendicular 
to the graph of y = x. Furthermore, the midpoint of PQ is (42, atb), which is on the graph of y = x. So, 
the graph of y = x is the perpendicular bisector of PQ, which means that P and Q are images of each other 
upon reflection over the graph of y = x. So, each point on the graph of y = f -1(x) is the mirror image of 
a point on the graph of y = f(x) when reflected over the line y = x (and vice versa). Therefore, the entire 


graph of y = f~! (x) is the mirror image of the graph of y = f(x) when reflected over the line x = y. O 


` If the function f has an inverse f~, then the graph of y = f~1(x) is the reflection 
of the graph of y = f(x) over the line x = y. 


Important: 


Y 
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Problem 1.22: Suppose functions f and g both have inverses. Show that if h = f o g, then the inverse of h is 
-1 o f-1 
w 


Solution for Problem 1.22: Let t = g" o f-t. To show that t is the inverse of h, we must show that h(t(x)) = x and 
t(h(x)) = x. We'll start with h(t(x)). Using our definitions of h and t, we have 


h(t(x)) = FET UTT O). 
That looks pretty scary! But seeing both g and g~! right next to each other reminds us that g(g~1(x)) = x. In other 
words, the output of g o 7! is always the same as the input. So, when we put f~! (x) into g o g!, we get f'(x) 
out, which means 

E ODES 
So, using this in our expression for h((x)) above, we have 
h(t(x)) = FEET ED) = FF" @))- 
Because f~! is the inverse of f, we have f(f7!(x)) = x, so h(E(x)) = x. 
We can do essentially the same thing for t(h(x)). We start with 
tnx) = gf (F(g@))))- 


Because f is the inverse of f 1, we have f—!(f(g(x))) = g(x), so t(h(x)) = g If *(f(9(x)))) = 9 (g(x) = x, as desired. 
Because h(#(x)) = t(h(x)) = x when h = f o g and t = g7! o f1, the function g7! o f~! is the inverse of f o g. O 


E Exercises E 


1.4.1 If f(x) = 44, what is the value of f~1(3)? 


x-2” 


1.4.2 Determine whether or not each of the following functions has an inverse. If the function has an inverse, 
find it. 


(a) f(x)=2x-7 (c) f(x) = V2 (e) f(x) =vV4—x4+vVx-2 
(b) f(x)= — 3 (d) f(x) =|5-2| (fx f(x) =x? — 6x +3, where x > 4 


1.4.3 Describe all ordered pairs (a,b) such that the inverse of f(x) = ax + b satisfies f(x) = f'(x) for all x. 


1.4.4 Let g(x) = ante where a, b, c, and d are positive and ad + bc. Which of į, £, and § cannot be in the domain of 


g1? Hints: 146 


1.5 Summary 


A function gives a single output for each input. The domain of a function consists of all the values we are able to 
input to the function and get an output. Meanwhile, the range of the function consists of all the values that can 
come out of the function. 


When we graph the function f, we graph the equation y = f(x) on the Cartesian plane. Therefore, if the point 
(a, b) is on the graph of y = f(x), then f(a) = b. Conversely, if f(a) = b, then (a,b) is on the graph of y = f(x). 


A curve on the Cartesian plane is the graph of some function if and only if every 
vertical line passes through no more than one point on the curve. We call this 


the vertical line test. 
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1.5. SUMMARY 


When k > 0, the graph of y = f(x) +k results from shifting the graph of y = f(x) 
vertically upward by k units, while the graph of y = f(x+k) results from shifting | 
the graph of y = f(x) horizontally to the left by k units. Similarly, the graph of 
y = f(x) — k is a k-unit downward shift of the graph of y = f(x) and the graph 
of y = f(x - k) is a k-unit rightward shift. 


The graph of y = —f(x) is the result of reflecting the graph of y = f(x) over the 
x-axis. The graph of y = f(—x) is the result of reflecting the graph of y = f(x) 
over the y-axis. í 


When k > 0, the graph of y= k f(x) results from scaling the graph of y = f(x). 
vertically by a factor of k, while the graph of y = f(kx) results from scaling the | 
graph of y = f(x) horizontally by a factor of 1/k. 


When we perform a composition of two functions, we place the output from the first function into the second. 
For example, both f(g(x)) and (f © g)(x) indicate that we take the output g(x) and input this result to f. We 
have a special notation for when we feed the output of a function back into the function itself. In this text, we 
write f?(x) to mean f(f(x)). Likewise, f”(x) refers to applying the function f(x) exactly n times; for example, 


PO = FEEFEE). 


Finally, we have a special name for a pair of functions such that each function “undoes” the action of the other: 


Definition: If f and g are functions such that 


g(f(x)) = x for all values of x in the domain of f, and 


f(g(x)) = x for all values of x in the domain of g, 


| then f is the inverse of g, and g is the inverse of f. | 


Not every function has an inverse. If a function gives the same output for two different inputs, then the 
function does not have an inverse. If there are no two different inputs to a function that produce the same output, 
then the function has an inverse. If f has an inverse g, then we can often find that inverse by solving the equation 


f(g(x)) = x for g(x). 
We usually denote the inverse of the function f as f~'. If f has an inverse, then the graph of f~! is the reflection 
of the graph of f over the line y = x. 


Important: A function has an inverse if and only if there does not exist a horizontal line 
i Vv that passes through more than one point on the graph of the function. 


Things To Watch Out For! 


e When evaluating a function that accepts multiple inputs, make sure you assign the values to the dummy 
variables in the correct order. 


e We must take into account any restrictions on the domain of a function when finding the range of the 
function. 


e Sometimes there is more than one constraint on the domain of a function. 
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e A common mistake is to think that the graph of y = f(x +k) is a rightward shift of the graph of y = f(x) when 


k is positive. This is incorrect; make sure you see why. 


e The graph of y = f(kx) is a horizontal scaling of the graph of y = f(x) by a factor of 1/k, not by k. So, the 
graph of y = f(3x) is the result of compressing the graph of y = f(x) towards the y-axis, not stretching it 


away from the y-axis. 


e Some sources use f*(x) to mean f(x) - f(x) instead of f(f(x)). Likewise, f"(x) is sometimes used to denote 
[f(x)]". You'll often have to determine by context whether f"(x) refers to a function applied n times, or to a 


function raised to the n power. 


e Don’t be intimidated by notation. Often the notation is a fancy way of representing very simple ideas. 


qy 


e When working with functions, f! does not mean 7! The expression f~! is a special notation that denotes 


the inverse of the function f. 


Problem Solving Strategies 


Concepts: e We can often check our answers in algebra problems by plugging the solu- 
tions we find back into the original question. 


_e Keep your eye on the ball. Sometimes we don’t need to evaluate every 
variable in a problem in order to solve the problem. 


[UC eeviewProstems ie 


1.23 Find the domain and range of each of the following functions: 


1 
(a) AG = Ax? +1 (c) P(x) = an 


(b) o(x) =3 + 4/16 - (x -3} (a) su) = a 


1.24 Find the domain of each of the following functions: 


9 -— 3x 


1 
(a) f(x) = Varna 
(b) N = : 
(me) = yix] -2 + yx 


1.25 Let f(x) = yap and g(x) = o ae Find f(g(x)). 


1.26 Let f(x) = ax? + bx +c and g(x) = ax? — bx + c. If f(1) = g(1) + 2 and f(2) = 2, find g(2). 
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REVIEW PROBLEMS 


1.27 At right is the graph of y = f(x). Graph each of the following: 


(a) y=f@-3) 
b) y=2f(x)+1 
(c) y= f(-x) 


(d) y= f(2-x) 
(e)x y= 4f(2x-1) +3 Hints: 249 


1.28 Suppose that f(f(x)) = g(f(x)) for all real x. Must f and g be the same 
function? 


1.29 For each of the following three graphs, determine if the graph could represent a function. If the graph can 
represent a function, then determine if the function has an inverse. If it does have an inverse, then graph the 
inverse. 


(a) 


1.309 Let (7,2) = = If f(a,b,c) = 1, what is the value of f(a, c, b)? 


1.31 Suppose that f(x) = —x* + bx + c and g(x) = dx + e, where b, c, d, and e are real constants. Is it possible that 
f(9(x)) = x°? Why or why not? 


1.32 Must the inverse of a linear function f(x) = ax + b, where a + 0, be a linear function? 
1.33 Suppose that a function f(x) is defined for all real x. How can you use the graph of f(x) to produce the 
graph of y = |f (x)|? 


1.34 Let f(t) = ES where t # 1. If y = f(x), then x can be expressed as which of the following: 


f(1/y), — fly), -f(-y), fy), fy? 


(Source: AHSME) 
cx 


1.35 The function f defined by f(x) = Eee 


(Source: AHSME) 


1.36 Suppose that f(x) has an inverse, f(x). Must f -1 (2x) be the inverse of f(2x)? If so, prove it. If not, find an 
expression that is the inverse of f (2x). 


satisfies f(f(x)) = x for all real numbers x except —3/2. Find c. 


1.37 The function f(x) = x? is not invertible. However, if we let g(x) = yx, then f(g(x)) = (yx)? = x. So, why isn’t 
f invertible? 
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ii Challenge Problems > 


1.38 Describe all constants a and b such that f(x) = = = and f(x) = f7'(x) for all x in the domain of f. 
1.39 Suppose that a function f(x) has domain (—1,1). Find the domains of the following functions: 
(a) f(x+1) Œ) f/x) O AVD (d)x f (24) 


1.40 Find the domain and range of the function f(x) = V2 — x — x*. Hints: 47 
1.41 Let f(x? +1) = x*+5x* +3. What is f(x? — 1)? (Source: AMC 12) Hints: 184 


1.42 Suppose that a function f(x) is defined for all real x. How can we obtain the graph of f(|x|) from the graph 
of f(x)? Hints: 271 


1.43 If f(f(x)) = x for all x for which f(x) is defined and f(0) = 2003, find all roots of the equation f(x) = 0. 


ax+b 
1.44 For which constants a, b, c, and d does the function f(x) = = rE have an inverse? 


1.45 Suppose that g(x) = f(x)f (lxi), where f(x) is a function with R as its domain. Must all x-intercepts of the 
graph of f be x-intercepts of the graph of g? Must all x-intercepts of the graph of g be x-intercepts of the graph 
of f? 


1.46 At right is the graph of y = f(x). Draw the graph of y = ; fsa 2) = 1 


1.47x The function f defined by 


where a, b, c, and d are nonzero real numbers, has the properties f(19) = 19, f(97) = 97, 
and f(f(x)) = x for all values of x except —d/c. Find the unique number that is not in 
the range of f. (Source: AIME) Hints: 185 
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There is nothing more difficult to take in hand, more perilous to conduct or more uncertain in its success than to take the lead 
it the introduction of a new order of things. — Nicoolo Machiavelli 


CHAPTER 


— 


Introduction to Trigonometric Functions 


Trigonometry is the study of triangles, but when most people refer to “trigonometry,” they speak of a special set 
of functions that are often useful when working with triangles. You may be familiar with some of these functions, 
such as sine, cosine, and tangent, from applying them to the acute angles of right triangles. In this text, we go 
well beyond right triangles, starting with a more complete definition of these trigonometric functions. 


Sidenote: Throughout this text, you’ll see some funny symbols used to represent angle 
measures. These funny symbols are lowercase Greek letters, which are often used 
to denote angle measures. Perhaps mathematicians use Greek letters this way 
because the Greeks contributed so much to geometry, or perhaps because the 
letters are so fun to write! Here’s the lowercase Greek alphabet: 


a alpha v nu 

B beta č xi 

y gamma o omicron 
E epsilon p rho 

C zeta o sigma 
n eta T tau 

0 theta v upsilon 
L iota o) phi 

K kappa x chi 

A lambda y psi 

u mu w omega 
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CHAPTER 2. INTRODUCTION TO TRIGONOMETRIC FUNCTIONS 


2.1 The Unit Circle 


We find the cosine, sine, and tangent of an angle @ by using a circle with 
radius 1 centered at the origin of the Cartesian plane. We call this circle the unit 
circle. We start by finding the point that results when the point (1,0) is rotated 
counterclockwise by @ about the origin. We call the resulting point the terminal 
point of 6. For example, point P in the diagram is the terminal point of 50°, since it 
is the point that results when (1, 0) is rotated 50° counterclockwise about the origin. 
Since the terminal point of an angle is the result of rotating about the origin a point 
that is 1 unit away from the origin, the terminal point of an angle is always on 
the unit circle. (In order to prevent diagrams from becoming too cluttered, we'll 
usually omit the (1, 0) that indicates that the circle has radius 1.) 


Once we have located the terminal point of 0, we define cos 0, sin 0, and tan @ as follows: 


Definition: The cosine of an angle @ is the x-coordinate of the terminal point of 0. We denote the cosine of 
8 as cos 8. 
The sine of an angle @ is the y-coordinate of the terminal point of 0. We denote the sine of @ as sin 8. 


The tangent of an angle @ is the ratio of the y-coordinate to the x-coordinate of the terminal point of 0. We 
denote the tangent of 0 as tan 8. 


pa n 


Putting these together, if (x, y) is the terminal point of 0, then Y 


cos 0 =x, 
(cos 0, sin 0) 


tanga Y =- me -a x 


As the diagram indicates, these definitions can be used for any angle 0, not 
just acute angles. 


One way to remember which coordinate is sine and which is cosine is to 
remember that it’s simply alphabetical order: cosine comes before sine, and x comes before y. To help remember 
that tan 8 is the ratio 4 rather than 7 we can think of the tangent of 0 as the slope of the line through the origin 
and the terminal point of @. 


Three other trigonometric functions are secant, cosecant, and cotangent, which are denoted as sec, csc, and 
cot, respectively. These are defined in terms of cosine, sine, and tangent as follows: 


1 
g= ? 
ii cos 0 
1 
csc 0 a 
1 
Wie _ COS 0 


In this section, we become familar with the trigonometric functions by evaluating them at various angles. Feel 
free to look back at the definitions as you work through the problems. You don’t need to memorize them right 
now; you'll eventually memorize them as you use them. 


Finally, as you've probably already realized, ’trigonometric” is a long word. Worse yet, it’s a mouthful when 
spoken, so “trigonometry” and “trigonometric” are often shortened as “trig.” 
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2.1. THE UNIT CIRCLE 


Problems 


Problem 2.1: 


(a) Triangle ABC below is a 45-45-90 triangle, which means one angle is a right angle and the other two 
angles are equal. If BC = 1, then what are AB and AC? 


E 


45° 


e 45° \ 3 yo 30° y 
Figure 2.1: Diagram for Part (a) Figure 2.2: Diagram for Part (b) 


(b) Triangle XYZ above is a 30-60-90 triangle, which means that its angles are 30°, 60°, and 90°. If YZ = 1, 
then what are XY and XZ? 


Problem 2.2: Find the cosine, sine, and tangent of each of the following angles: 
(a) 0 (c) 45° (e) 60° 
(b) 90° (d) 30° (f) 270° 


Problem 2.3: The trigonometric functions are often introduced as ratios of sides in X 
a right triangle. For example, in right triangle AXYZ at right, we have sin@ = %4 
cos @ = $4, and tan = 2. 
(a) Show that this definition is consistent with the unit circle definition. That is, 
show that if we compute sin 0, cos 0, and tan@ using the unit circle, where @ is 


ZX in right triangle AXYZ at right, then we get 


¢ YZ myx ge 
sin @ = Xz’ cos@ = Xz’ tan 0 = Yx 


(b) Why are the unit circle definitions more useful than the right triangle definitions? 


Problem 2.4: We can think of the coordinate axes as dividing the Cartesian plane into four 
quadrants, which are commonly labeled as shown at right. We say that an angle belongs to i I 
the quadrant in which its terminal point lies. So, for example, 120° is in quadrant II and 315° 

is in quadrant IV. We also say that 120° is a “second quadrant” angle and 315° is a “fourth 

quadrant” angle. (We typically don’t speak of quadrants when referring to an angle whose III IV 
terminal point is on one of the coordinate axes.) 


(a) In which quadrants is sine positive? 


(b) In which quadrants is tangent positive? 


Problem 2.5: Find each of the following: 


(a) cos120° (c) sec210° (e) csc 240° 
(b) sin 135° (d) cot315° (f) tan 330° 
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Problem 2.6: We can use the unit circle to evaluate trig functions of angles that are more than 360° or less than 
0°. If an angle is less than 0°, then we find its terminal point by going clockwise around the unit circle from (1,0), 
rather than counterclockwise. Find each of the following: 
(a) cos 1080° (c) cot(—45°) 

(b) sin 3720° (d) sec(-—480°) 


Problem 2.7: We typically write the squares of sinx and cosx as sin? x and cos? x, and do similarly for other 
positive powers of trig functions. Prove that for any angle 0, we have sin? 0 + cos? @ = 1. 


Problem 2.8: Find the range of each of the following functions: 
(a) f(f)=sint (b) g(t) = tant (c) h(t) = sect 


Problem 2.1: 
(a) Triangle ABC below is a 45-45-90 triangle, which means one angle is a right angle and the other two 
angles are equal. If BC = 1, then what are AB and AC? 


C Z 
| : 60° 
4. [e] 
| > ~ | Ts 
A 45° \ B x 30° y 
| Figure 2.3: Diagram for Part (a) Figure 2.4: Diagram for Part (b) 


(b) Triangle XYZ above is a 30-60-90 triangle, which means that its angles are 30°, 60°, and 90°. If YZ = 1, 
then what are XY and XZ? 


Solution for Problem 2.1: 


(a) We let AB = x. Since ZB = /C, we have AC = AB = x. Since BC = 1, the Pythagorean Theorem gives us 
x? +x* = 1, so x? = 1/2. Therefore, we have 


Both AB and AC have length ¥2/2. 


(b) The 60° and 30° angles makes us think of equilateral triangles. By reflecting Z 
AXYZ over side XY, we form equilateral triangle TYZ in the diagram at right. 
Since ATYZ is equilateral, we have TZ = ZY = 1. Since ATYX = AZYX, we 
have TX = XZ. Putting this together with TZ = 1 gives XZ = 1/2. Finally, the 
Pythagorean Theorem applied to AXYZ gives 


xy = Waa = 1-1 - ese 


4 2 
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If the hypotenuse of an isos 


celes right triangle has length 1, then each leg | 


VV has length ae If the acute angles of a right triangle are 30° and 60° and the 
hypotenuse has length 1, then the leg opposite the 30° angle has length £ and 


the leg opposite the 60° angle has length 28, These relationships are depicted 


below. 
2 > 1 o 1 | 
Ho 45° - 30° 
¥3/2 


V2/2 
Problem 2.2: Find the cosine, sine, and tangent of each of the following angles: 
(a) 0° (c) 45° (e) 60° 
(b) 90° (d) 30° (f) 270° 


E 
NIe 


Solution for Problem 2.2: 


(a) The point on the unit circle that is 0° counterclockwise from (1,0) is simply (1,0). Since the terminal point 
of 0° is (1,0), we have 


cos0° = 1, 
sin = 0, 
tan0° = 0. 


(b) The point on the unit circle that is 90° counterclockwise from (1,0) is (0,1). Since the terminal point of 90° 
is (0,1), we have cos 90° = 0 and sin 90° = 1. 


Since tan @ is defined as amg 
cos 0 


Important: If cos 0 = 0, then tan @ is undefined. 


, tan @ is undefined whenever cos @ = 0. 


Therefore, tan 90° is undefined, because cos 90° = 0. We can also see this in terms of the unit circle. The 
slope of the line from the origin to (0,1) is undefined, so tan 90° is undefined. Similarly, tan 0 is undefined 
whenever the terminal point of @ is on the y-axis (and therefore has x-coordinate 0, which means that cos 0 
is 0). 

(c) We first find the terminal point, P, of 45° by rotating the point (1,0) counter- 
clockwise an angle of 45°. To find the coordinates of P, we build right triangle 
OPS, as shown, and see that AOPS is a 45-45-90 right triangle with hypotenuse 


OP = 1. Therefore, we have OS = SP = ¥2, so point P is (32, ¥2) and: 


0 
2 
cos45° = v2 
2 
2 
sin 45° = v2 
2 
fan45° = 1 
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(d) As in the previous part, we locate the terminal point, P, and then build a right y 
triangle by drawing the altitude from P to the x-axis. Since AOPS is a 30-60-90 


triangle with OP = 1 and ZPOS = 30°, we have OS = 2 and PS = i, so point 


P is (= 1). Therefore, we have 


d PaA 
|_| 
ae way x 
cos 30° = —, 
P 
íl 
in30° = =, 
sin 5 


el TA 
T 


tan 30° = = — 


A B V3 V3 


P 


The last two parts show a common strategy for evaluating trigonometric functions of some angles: 


Important: To evaluate trigonometric functions of angles that are multiples of 30° or 45°, | 
Y we can locate the terminal point of the angle on the unit circle, and then build 


l a right triangle by drawing an altitude from the terminal point to the x-axis. 


(e) Once again, we build a right triangle after locating the terminal point. 
Since OP = 1 and ZPOS = 60°, we have OS = } and PS = 3. Therefore, 


Y 
IR 

point P is G, 8), and we have 

cos 60° = 

‘= 
x 
sin 60° = . 
5 V3/2 
tan 60° = 72 = v3. 


(f) The point (0, —1) is 270° counterclockwise from (1,0). Therefore, we have 
cos 270° = 0, 
sin270° = -1, 
tan 270° is undefined. 


A 


~x 


o 


This book may not be the first time you have encountered trigonometric functions. Trig functions are often 
introduced in geometry classes, where they are defined a little differently. 


| Problem 2.3: The trigonometric functions are often introduced as ratios of sides in X 
a right triangle. For example, in right triangle AXYZ at right, we have sin 0 = Xz, 
cos @ = %¥, and tan0 = %. 
(a) Show that this definition is consistent with the unit circle definition. That is, 
show that if we compute sin 0, cos 0, and tan 0 using the unit circle, where 0 is 


ZX in right triangle AXYZ above, then we get 


YZ 


tan0 = ——. 
an0 YX 


Yz 
XZ’ 


sin 0 = 


Why are the unit circle definitions more useful than the right triangle definitions? 


2.1. THE UNIT CIRCLE 


Solution for Problem 2.3: 


(a) We start by finding the terminal point of @ on the unit circle. From the 
definition of sin 0, we have sin @ = PA. To relate this to AXYZ in the problem, 
we note that ZX = ZPOA and ZY = ZPAO, so AXYZ ~ AOAP, which means 
that 3% = pô. Since PO = 1, we have %2 = PA. Therefore, the unit circle 
definition gives the same result for sin @ as the right triangle definition. 


Similarly, we have cos @ = OA and tan@ = awe From AXYZ ~ AOAP, we 


have ee = oA = OA and xe = oe so the right triangle definitions of cos 0 


and tan @ for acute angles 0 are consistent with their unit circle definitions. 


| Important: [f triangle XYZ is a right triangle with ¿XYZ = 90° and X 


W ZX = 0, then | 


| az YX YZ 
| sin 0 = y7, cos = yz, tanO = yy: 
| Another way of viewing this is: if 0 is the measure of | 

, ; Y Z 
| an acute angle in a right triangle, then we have 

ite 1 j ite | 

anoa opposite G Te adjacent lesi oa Oppesite aS 
| hypotenuse hypotenuse adjacent leg 


(b) The unit circle definition is much more powerful because it allows us to define trig functions for non-acute 
angles. 


Problem 2.4: We can think of the coordinate axes as dividing the Cartesian plane into four 
| quadrants, which are commonly labeled as shown at right. We say that an angle belongs II I 
to the quadrant in which its terminal point lies. So, for example, 120° is in quadrant II 

| and 315° is in quadrant IV. We also say that 120° is a “second quadrant” angle and 315° 

| is a “fourth quadrant” angle. (We typically don’t speak of quadrants when referring to an III IV 
angle whose terminal point is on one of the coordinate axes.) 


(a) In which quadrants is sine positive? 


(b) In which quadrants is tangent positive? 


Solution for Problem 2.4: 


(a) The sine of an angle is positive if and only if the y-coordinate of the angle’s terminal point is positive. 
Therefore, sine is positive in quadrants I and II, and negative in quadrants III and IV. 


(b) Since tangent is the quotient of sine and cosine, it is positive if and only if sine and cosine are both positive 
or both negative. Sine and cosine are both positive in the first quadrant, and both negative in the third 
quadrant (since both x and y are negative in the third quadrant). Therefore, tangent is positive in quadrants I 
and III. In quadrant II, the x-coordinate is negative while the y-coordinate is positive, so tangent is negative. 
In quadrant IV, the x-coordinate is positive while the y-coordinate is negative, so tangent is again negative. 


Oo 


You don’t have to memorize which trig functions are positive and which are negative in each quadrant. This 
information will come naturally if you understand the unit circle and the definitions of the trig functions. As 
practice, determine the sign of the answers in the following problems before finding the answers. 


a 
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| Problem 2.5: Find each of the following: 


(a) cos120° (d) cot315° 
(b) sin 135° j (e)  csc240° 
(c) sec210° (f)  tan330° 


Solution for Problem 2.5: 


(a) We first find the terminal point of 120°. Since 120° is between 90° and 180°, 
the terminal point of 120° is in the second quadrant, as indicated by point P. P 
is 120° — 90° = 30° beyond the y-axis, so when we draw altitude PS, we have 
ZPOS = 90° — 30° = 60°. Therefore, APOS is a 30-60-90 triangle with OS = į 


and PS = a8 . We then must be careful not make the to iowmg i mistake: 


Barus Solution: The A of P are (4, Dye so 


cos 120° = = 


The mistake here is that P is to the left of the y-axis. Therefore, while OS 

-3, not i. So, we have 

1 
cos 120° = -=. 

(b) Since 135° is between 90° and 180°, it is a second quadrant angle. The terminal 
point, P, is 180° — 135° = 45° clockwise from (—1,0), so drawing altitude PS 
forms a 45-45-90 right triangle, from which we find PS = SS Since P is above 
the x-axis, its y-coordinate is positive, and we have 


sin 135° = 22 
2 
(c) First, we note that 
1 
210° = ———_, 
Sgann cos 210° 


so we must find the x-coordinate of the terminal point, P, of 210°. P is 
210° — 180° = 30° counterclockwise from (—1,0), so it is in the third quad- 
rant. Drawing altitude PS to the x-axis forms 30-60-90 triangle POS with 
¿POS = 30°, so OS = a Since P is to the left of the y-axis, the x-coordinate 


of P is —OS, which is mC So, we have 


E 
seca Se O 


cos210° -43/2 V3 3 


= 3, the x-coordinate of P is 


Ir 
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(d) First, we note that 
1 


tan315°` 

Since 315° is 45° less than 360°, its terminal point, P, is just 45° less than a full 
counterclockwise revolution of (1,0) about the origin. Therefore, P is in the 
fourth quadrant, as shown, and ZPOS = 45°. This gives us OS = SP = =- 


Because P is to the right of the y-axis and below the x-axis, its coordinates are 
( 3 E 
a 


cot315° = 


J So, we have 


-ae 
T 
which gives us cot315° = 1/(tan 315°) = —1. 

(e) We have 


tan 315° = 


=í; 


z o 
i sin 240° 


so we seek the y-coordinate of the terminal point of 240°. Since 240° is between 
180° and 270°, it is a third quadrant angle whose terminal point is 240° —180° = 
60° counterclockwise from (—1, 0). In right triangle POS, we have ZPOS = 60°, 


DOS and PS = a Because P is below the x-axis, its y-coordinate is 
negative: 
1 il Da ANE 


240° = = 3 == 53 
PEA ae E © EG a EE 


(f) Since 330° is 30° less than 360°, the terminal point of 330° is 30° clockwise from 
(1,0), which puts it in the fourth quadrant with ZPOS = 30°, as shown. We 


therefore have OS = 2 and PS = Ł, so Pis (£, -1), and 


—1/2 1 3 
tan 330° = Bo oe 


Bn 6 


We can also use the unit circle to evaluate trig functions of angles that are more than 360° or less than 0°. If 
an angle is less than 0°, then we find its terminal point by going clockwise around the unit circle from (1, 0), rather 


than counterclockwise. 


Problem 2.6: Find each of the following: 
(a) cos 1080° (c)  cot(—45°) 
(b) sin 3720° (d) sec(—480°) 


Solution for Problem 2.6: 


(a) Going 1080° counterclockwise around the unit circle from (1,0) gets us to the same terminal point as going 
1080° — 360° = 720° counterclockwise, since 360° is simply one full revolution around the circle. Since 
720° = 2-360°, going 720° around the unit circle is two full revolutions, which brings us back to (1,0) as the 


terminal point. Therefore, we have: 


cos 1080° = cos720° = cos0° = 1. 
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We can similarly evaluate trig functions of any angle greater than 360° by subtracting multiples of 360° 
until we have an angle that is less than 360°. 

(b) Rather than repeatedly subtracting 360°, we notice that 3720° is slightly greater 
than 10 - 360°, so we subtract 3600°, which is the same as 10 full revolutions 
around the unit circle. Therefore, 3720° has the same terminal point as the 
angle 3720° — 10-360° = 120°. So, we have sin 3720° = sin 120°. In the diagram 
at right, we find the terminal point, P, of 120°, and form the 30-60-90 triangle 


POS with ZPOS = 60°. Since PS = a and P is above the x-axis, we have 


a = ein 120° = 2 


(c) Since the angle is negative, the terminal point is 45° clockwise from (1,0), as 
shown. From 45-45-90 triangle POS, we have PS = OS = a so the coordinates 


of P are (£, -#), which gives us 
_45° 5 
cot(—45°) = 1 _ cos(—45°) _ 2/2 be 


tan(—45°)  sin(—45°) — 2/2 _ 


(d) Going 480° clockwise around the unit circle brings us to the same terminal 
point as going 480° — 360° = 120° clockwise. This is the same as noting that 
the terminal point of —480° is the same as the terminal point of —480° + 360° = 
—120°. We find the terminal point by going 120° clockwise about the unit circle 
from (1,0), or by noting that the terminal point of —120° is the same as the 
terminal point of —120° + 360° = 240°. Either way, we have 30-60-90 triangle 
POS with ZPOS = 60°, from which we find OS = A Since P is to the left of the 
y-axis, its x-coordinate is negative, and we have 


1 1 
sec(—480°) = sec(—120°) = ———___ = —2. 


cos(—120°) —1/2 J 
O 


Understanding the unit circle not only allows us to evaluate trigonometric functions of some angles, but it also 
allows us to visualize and sometimes prove trigonometric identities. A trigonometric identity is an equation that 
is true for all angles for which the equation is defined. We now introduce the most common trig identity; we'll 
explore trig identities in much greater detail in Chapter 3. 


Problem 2.7: We typically write the squares of sin x and cos x as sin? x and cos? x, and do similarly for other 


positive powers of trig functions. Prove that for any angle 0, we have sin? 0 + cos? 0 = 1. 


Solution for Problem 2.7: By definition, the point (x, y) = (cos 0, sin @) is the point on the unit circle that is 0 degrees 
counterclockwise about the origin from (1,0). The unit circle is the graph of the equation x? + y? = 1. Since the 
point (cos 0, sin @) is on this graph, the ordered pair (x, y) = (cos 0, sin @) must satisfy the equation x? + y? = 1, 
which gives us cos? 0 + sin? O = 1. Reversing the addends on the left gives us the more commonly written form 
of this trigonometric identity: 
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Important: 


For any 0, we have sin? @ + cos? @ = 1. 


The identity sin? 0 + cos? @ = 1 isn’t really something new—it’s essentially the A 
Pythagorean Theorem. At right is a right triangle ABC with hypotenuse of length 1 
and ZBAC = @. Because sin@ = pe and AC = 1, we have BC = sin@. Similarly, 
cos 0 = $£, so AB = cos 0. The Pythagorean Theorem gives us cos 0 


BC? + AB? = ACÈ, J 
B sin 0 C 
sO 


sin? 0 + cos? @ = 1. 


WARNING!! In Section 1.3, we noted that P (x) usually means f(f(x)). Trigonometric 
4a 2 £ z : O | 
functions are a common exception to this. When we write sinf x, we mean 


(sin x)*, not sin(sin x). 


Problem 2.8: What is the range of each of the following functions: 
|© f= sint 

| ©) g(t) = tant 
| (c) A(t) = sect 


Solution for Problem 2.8: 


(a) Any point (x, y) on the Cartesian plane with a y-coordinate that is greater than 1 or less than —1 must be 
more than 1 unit from the origin (since ,/x? + y? > 1 if |y| > 1 and x is real). Therefore, no point on the unit 
circle can have y-coordinate greater than 1 or less than —1, which means —1 < sint < 1. Every value in the 
interval [—1,1] is in the range of sin t, since the line y = k intersects the unit circle for any value of k such 
that —1 < k < 1. So, the range of f(t) = sin t is [-1, 1]. Similarly, the range of cos t is also [—1, 1]. 


Important: For any angle 0, we have —1 < sin < land -1 < cos 0 <1. 


(b) The value of tan t is the slope of the line between the origin and the terminal point of the angle t. Therefore, 
for any real number m, we can find a value of t such that tant = m by constructing the line through the 
origin with slope m. This line must intersect the unit circle, and each point at which the line intersects the 
unit circle is the terminal point of an angle whose tangent is m. Therefore, the range of g(t) = tant is all real 
numbers. 


(c) First, we note that sec t = 1/(cos t), so we can find the range of sec t from the range of cos t, which is [-1, 1]. 
If we have 0 < k < 1, then the reciprocal of k is at least 1. Conversely, any number greater than or equal to 1 
is the reciprocal of a number in the range of cos t. Therefore, every number in the interval [1, +00) is in the 
range of h(t) = sect. Similarly, every number in the interval (—co, —1] is in the range of h. Finally, no number 
in the interval (—1, 1) is in the range of sect, since such a number being in the range of sec £ would mean 
that |cos t| > 1, which is impossible. Combining these observations, the range of sec t is (—o0, —1] U[1, +00). 
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2.1.1 Evaluate each of the following: 


(a) sin 120° (c) sec45° (e) cos180° 
(b) cot60° (d) tan210° (f) csc 300° 


2.1.2 Evaluate each of the following: 


(a) cos(—90°) (c) sin 750° 
(b)  tan36000° (d) sec(—1200°) 


2.1.3 Which of the following are positive: sin 293°, cos(—68°), cot 206°, csc 90.5°? 


2.1.4 For how many values of O such that 0 < 6 < 900° is cos 0 = 222 


2.1.5 What is the range of the function f(x) = 7 + 2sin3x? 


2.2 Radians 


There are many different units we can use to express distance. Our choice of units is usually determined by what 
We are measuring. For example, we wouldn’t measure the distance between two cities in centimeters or inches; 
we usually use kilometers or miles. Similarly, we use feet or meters for the height of a person instead of miles or 
kilometers. 


Just as with distance, there are different units for angles, and our choice of units is often determined by our 
purpose. In geometry, we typically use degrees, which we denote with the usual degree symbol, °. However, 
trigonometric functions of angles are a significant part of algebra, and in an algebraic context, degrees are not a 
convenient unit. The most convenient unit of measure for angles is the radian. 


To measure an angle in radians, we draw a circle with radius 1 centered at the vertex B A 
of the angle. The measure of the angle in radians equals the length of the arc cut off by the 


angle. So, in the diagram at right the measure of @ in radians equals the length of arc AB. 
As another example, the measure of a right angle in radians is + because the circumference 
of a circle with radius 1 is 27 and a right angle cuts off one-quarter of the circumference. 


While we introduced the idea of a different unit for angles by mentioning the many 
units available for length, there is a much more fundamental difference between degrees 
and radians than there is between inches and centimeters. The choice of what distance is 
“one inch” is an arbitrary human creation. Similarly, the choice of what angle measure is “one degree” is arbitrary 
because we could have chosen any number of degrees to be the number of degrees in a circle. 


With radians, the situation is different. By specifying the measure of an angle as the arc length cut off by the 
angle in a circle of radius 1, we are really defining a radian with a ratio. We could have equivalently defined 
the measure of an angle in radians as “the ratio of the length of the arc cut off by the angle to the radius.” This 
means that the radian is a dimensionless unit. By this, we mean that we do not use some human-defined angle 
as a reference to tell us what a radian is. For degrees, we need “a full circle is 360°” to tell us what a degree is, 
so the degree is not a dimensionless unit. For meters, we use the reference length “the distance traveled by light 
in a vacuum during 1/299792458 of a second,” so the meter is not a dimensionless unit. (No, we didn’t make 
that number up!) But to measure an angle in radians, we don’t need to compare the angle to any human-defined 
reference point, so the radian is a dimensionless unit. 
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Because the radian is a dimensionless unit, we usually use it instead of degrees when working with an angle 
measure in an algebraic context. For example, suppose the measure of an angle is x, and we must work in a 
problem with the quantity x°. Measuring the angle in degrees makes x? puzzling: what is a square degree? 
Measuring the angle in radians makes more sense; the measure is a number without any dimension, so squaring 
it is fine—that just gives us another number without any dimension. As we'll see later in the text, there are times 
that we wish to raise a number to a power that involves an angle measure. Raising a number to dimensionless 
number makes sense, but what is 2°i"hes? What units would such a result have? There’s no good answer to this 
question, so we generally don’t like to raise numbers to powers that are expressed in terms of a unit that is not 
dimensionless. 


Throughout this text, if we are evaluating a trigonometric function of a number that is without units, then the 
number should be assumed to be in radians, not in degrees. Therefore, sin1 does not mean the same thing as 
sin 1°. The expression sin 1 is the sine of 1 radian, not 1 degree. 


Problems ia 


Problem 2.9: Convert each of the following degree measures to radians: 
(a) 60° (Seale 
(b) 288° (d) 1400° 


Problem 2.10: Convert each of the following measures in radians to degrees: 
(a) 27/5 (c) -77/3 
(b) 47/3 (d) 1 


Problem 2.11: Evaluate each of the following: 


(a) cos 2 (c) 


(b) sin 7 (d) 


Problem 2.12: What is the sign of cot 30? 


| Problem 2.9: Convert each of the following degree measures to radians: 
(a) 60° @) Siler 
(b) 288° (d) 1400° 


Solution for Problem 2.9: 


(a) Since 60° is 60°/360° = 1/6 of a full circle, its equivalent in radians is (1/6)(27) = 7/3 radians. 

(b) Since 288° is 288°/360° = 4/5 of a full circle, its equivalent in radians is (4/5)(27) = 87/5 radians. 

(c) Since 315° is 315°/360° = 7/8 of a full circle, the equivalent of —315° in radians is (7/8)(—2m) = -77/4 
radians. 

(d) Since 1400° is 1400°/360° = 35/9 of a full circle, its equivalent in radians is (35/9)(27) = 707/9 radians. 


Rather than using the two-step process we used in our solutions above, we can use a conversion factor to do 
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both steps at once. Specifically, since 360° is equivalent to 27 radians, we can use the shorthand 


2n 


360° 

As an example, to convert 288° to radians, we multiply 288° by the conversion factor. This leaves the value 
unchanged because the conversion factor equals 1: 

27m7 _ 288° 


4 
om = 2 VS S12 = —, 
SS xa a 


Notice how the degrees “cancel out.” 


Converting radians to degrees is often even easier. Because 27 radians equals 360°, we know that 7 radians is 
equivalent to 180°. So, in an expression in radians that includes m, we can simply replace m with 180°. 


| Problem 2.10: Convert each of the following measures in radians to degrees: 


| (a) 2n/5 O -7n/3 


| b) 4n/3 (d) 1 


Solution for Problem 2.10: 


(a) Since 7 radians is 180°, we know that 27/5 radians equals 2(180°)/5, which equals 72°. 
(b) Since 7 radians is 180°, we know that 47/3 radians equals 4(180°)/3, which equals 240°. 
(c) As before, we replace m radians with 180°, so -77/3 radians equals —7(180°)/3, which equals —420°. 


(d) Ohno! We don’t have n in this expression. We don’t have to use n when expressing angles in terms 
of radians. Fortunately, we can still use our conversion factor approach by simply inverting our earlier 
conversion factor: 


360° 
=1 
27 
Now, when we multiply our angle in radians by this conversion factor, we are left with a measure in degrees: 
360° 180° 
1- ==, 
27 T 


Notice that we don’t sprinkle the word “radians” throughout this computation. Typically, we don’t write the 
units with angle measures expressed in radians, and any angle measure written without units is assumed 
to be in radians, not degrees. 


` Extra! Some common angle measures in radians are indicated in the regular polygons shown below. 
map n ah i 
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Problem 2.11: Evaluate each of the following: 


(a) cos 42 (c)  tan(—117) 
(b) sin? (d) sec 3# 


Solution for Problem 2.11: 


(a) We first convert radians to degrees. Since 7 radians is equivalent to 180°, we Y 
have P 


on 2(180°) > A 
cos = = cos = cos 120°. 
3 3 
Now, we can proceed as usual, by locating the terminal point of 120° on the NE 
unit circle as shown at right. From 30-60-90 triangle POS, we find OS = 1/2. 3 x 
Since P is to the left of the y-axis, we have cos 22 = cos 120° = —}. 
(b) Since m 5 Ae) = 420°, we have 


7 
sin = = sin 420° = sin(420° — 360°) = sin 60° = ae 
(c) Rather than converting to degrees first, we instead note that because a full revolution around the unit circle 
is 27 radians, we have tan(—117) = tan(—11z + 5-27) = tan(—7). Since a full revolution is 27, a half 
revolution is 7, so the terminal point of an angle of —7 radians is (—1,0). Therefore, sin(—z) = 0, which 
means 


sin(-71) _ 
cos(—7) ` 


tan(—117) = tan(—7) = 


(d) We could convert the angle to degrees, or we could note that the terminal point 
of 5 radians is (0,1) and the terminal point of z radians is (-1,0). Since a 
radians is the average of these two angles, its terminal point is the midpoint of 
the arc connecting these angles’ terminal points. In other words, its terminal 
point is the second quadrant point P shown in the diagram at right, in which 


ZPOS = 7 = 45°. From 45-45-90 APOS, we have OS = PS = L What’s wrong 

with this finish: 

| Bogus Solution: We therefore have 

pim 37 1 1 
sec — = —=— 


2 
4 sin 2/2 v2 


f 1 
The error here is that secx = ——, not ——. 
cosx sinx 


WARNING!! It’s very easy to confuse the definitions of sec x and csc x. Don’t be afraid to 
“S look them up when you encounter secant and cosecant. f 
Correcting this mistake, we have 


= z= Se SS SE SW 
ae cos — 2/2 v2 


Problem 2.12: What is the sign of cot30? 


Solution for Problem 2.12: What’s wrong with this solution: 
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Bogus Solution: | We have cot30 = cos 3 Since 30 is between 0 and 90, it is a first quadrant 


pia angle, which means that both cos 30 and sin30 are positive. Therefore, | 
cot 30 is also positive. 


The reasoning in this solution would be correct if the question had asked for the sign of cot 30°. However, the 
problem asks for cot 30, not cot 30°. 


WARNING! If an angle measure does not include the degree symbol or any indication of 
r 7 units, then the measure is assumed to be in radians, not degrees. So, if you 


are asked to find sin 30, the answer is not 1/2, since sin 30 refers to the sine 
of an angle of 30 radians, not 30 degrees. 


In order to find the sign of cot30, we must find the quadrant of an angle of 30 radians. We can take a bit 
of a shortcut by noticing that 7 is approximately 3.14, so 107, which has terminal point (1,0), is approximately 
31.4 radians. Therefore, an angle of 30 radians is approximately 31.4 — 30 = 1.4 radians clockwise from (1,0). 
(Remember, 30 radians is slightly less than 107 radians, so its terminal point will be less than 5 full revolutions 
counterclockwise from (1,0).) But how large is an angle of 1.4 radians? 


Again, we can estimate. Since 7 is approximately 3.14, we know that 5 radians, which is 90°, is approximately 
1.57 radians. Therefore, 1.4 radians is less than 90°. Since 30 radians is approximately 1.4 radians clockwise from 
(1,0), and 1.4 radians is less than 90°, we know that the terminal point of 30 radians is in the fourth quadrant. This 
means that cos 30 is positive and sin 30 is negative, so cot 30 is negative. O 


WARNING! When using a calculator or computer, make sure you know whether your 

“S device or program is using degrees or radians. If it assumes that the angle 
is in radians, and you enter “tan(30)” to find tan 30°, the answer will not be 
what you intend. One quick way to test it is to use it to compute sin 30°. If it 


returns something like —0.98803 instead of 0.5, then it is expecting the angles 
in radians, not degrees. 


PL > 
2.2.1 Convert each of the following degree measures to radians: 
(a) 330° (c) -315° 
(b) 270° (d) 2000° 
2.2.2 Convert each of the following measures in radians to degrees: 
(a) 3u (c) -77/5 
(b) 27/3 Gy = 


2.2.3 Draw a 30-60-90 triangle and a 45-45-90 triangle. In each triangle, label the angles with their measures in 
radians. Let the hypotenuse of each triangle be 1, and label the side lengths appropriately. 


2.2.4 Evaluate each of the following: 
(a) sing (c) tan 


(b) csc7n (d) cos (-2) 


2.2.5 What is the domain of the function f(x) = 3tan (2x - z) ? 
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2.2.6 Jake was taking a trigonometry test, and was asked to find tan 40°. He used his calculator to find the answer 
by typing in 40 and then hitting the tangent key. He dutifully wrote down what his calculator told him, —1.11721. 
He was very upset to have his answer marked incorrect, and was convinced that his calculator was broken. 


(a) Even without knowing what tan 40° equals, how should Jake have immediately known that his answer of 
—1.11721 was not the correct answer to the problem? 


(b) What mistake did he probably make? 


2.3 Graphs of Trigonometric Functions 


In this section, we study the graphs of trigonometric functions. Throughout our discussion of graphing trig 
functions, all angles will be expressed in radians. 


| Probiems Ba 


Problem 2.13: 
(a) Find the graph of y = sinx for0 < x < 27. 


(b) Use your answer to part (a) to construct the graph of y = sin x for all x. 


(c) A function f is called periodic if there is some nonzero constant t such that f(x) = f(x + t) for all values 
of x such that f(x) and f(x + t) are defined. The smallest possible positive value of t is called the period 
of f (if such a smallest value exists). Show that g(x) = sin x is periodic and find its period. 


| Problem 2.14: Graph y = cos x. What is the period of cos x? | 


Problem 2.15: What is the relationship between the graph of y = sin x and the graph of y = cosx? What does 
this suggest as a relationship between sine and cosine? 


Problem 2.16: h 
(a) The graph of y = sin x is symmetric about the origin. That is, if the graph is rotated 180° about the origin, 


the result is the same as the original graph. What does this tell us about the relationship between sin x 
and sin(—x)? 

(b) What symmetry does the graph of y = cosx exhibit, and what does this symmetry tell us about the 
relationship between cosx and cos(—x)? 


Problem 2.17: 
(a) What is the period of tan x? 
(b) Graph y = tanx. 
(c) We call a line an asymptote of a graph if the graph gets closer and closer to the line, but does not intersect 
it, as x or y gets far from 0. What lines are asymptotes of the graph of y = tanx? 


Problem 2.18: 
(a) Graph y = csex. 


(b) What is the period of csc x? 

(c) What lines are asymptotes of csc x? 

(d) At what points do the graphs of y = sin x and y = csc x intersect? 
(e) At what points do the graphs of y = cos x and y = csc x intersect? 
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Problem 2.19: What is the number of solutions of the equation z = sinx? (Source: AMC) 


Problem 2.13: 
(a) Find the graph of y = sin x for0 < x < 27. 
(b) Use your answer to part (a) to construct the graph of y = sin x for all x. 


(c) A function f is called periodic if there is some nonzero constant t such that f(x) = f(x + t) for all values 
of x such that f(x) and f(x + t) are defined. The smallest possible positive value of t is called the period 
of f (if such a smallest value exists). Show that g(x) = sin x is periodic and find its period. 


Solution for Problem 2.13: 


(a) We'll produce the graph of y = sin x by choosing a variety of angles 0 and evaluating sin 0 for those angles. 
We'll then plot the points (0, sin 0), and connect them to build our graph. We use @ here rather than x 
because we want to avoid confusion when speaking of x-coordinates of points on the unit circle. 


We start by choosing values of @ for which we can compute sin @ easily. We build a table as shown 
below, and use the table to shape the corresponding graph, which is shown at right below. 


We can also get an intuition for the shape of the graph by thinking about the unit circle. The y-coordinate 
of the terminal point of 0 is sin 0, so we track how the y-coordinate of the terminal point of O changes as 0 
ranges from 0 to 27. As 0 increases from 0 to 7/2, the y-coordinate of the terminal point of 0 increases from 
0 to 1. Moreover, this y-coordinate increases more quickly when @ is near 0 than when @ is near 7/2. This 
corresponds to the way the graph above increases quickly near 0 = 0, but the pace of this increasing slows 
as 0 nears 7/2. Continuing past 0 = 7/2, the y-coordinate of the terminal point decreases from 1 to 0 as 0 
increases from 7/2 to n. This corresponds to the downward trajectory of our graph from 0 = 7/2 to 0 = n. 


Then, @ continues from 7 to 37/2, and the y-coordinate of its terminal point goes from 0 to —1. Finally, 
0 goes from 37/2 to 27, during which time the y-coordinate of its terminal point goes from —1 to 0. So, as 
sin 0 goes from 0 to —1 and back to 0 while 0 goes from 7 to 27, we get the curve below the horizontal axis 
in the graph of above. 


(b) We find the rest of the graph by continuing with the unit circle. The value of sin 8 is the y-coordinate of the 
terminal point of the angle 0 on the unit circle. The angle @ has the same terminal point as the angle 6 + 27, 
and as the angle 0 + 47, and as the angle 6 + 6r, and so on. Similarly, it has the same terminal point as the 
angles 0 — 27, 0 — 47, and so on. So, we have 


+++ = sin(0 — 47) = sin(@ — 27) = sin 0 = sin(0 + 27) = sin(@+ 47) =.. 


In other words, the value of sin @ stays the same whenever we increase or decrease 0 by 2n. This means 
the vertical coordinate of a point on the graph stays the same each time we increase or decrease @ by 27. In 


46 


2.3. GRAPHS OF TRIGONOMETRIC FUNCTIONS 


(c) 


o 


other words, the graph repeats itself over and over in both directions forever, giving us the graph below. 


sin 0 


We can now replace “0” and “sin 0” in the graph above to produce the graph of y = sin x. We will also 
typically remove the 1 and —1 on the vertical axis to reduce the clutter of the graph, as shown below. 


Figure 2.5: Graph of y = sinx 


The only difference between this and our previous graph is the labeling of the axes, where we labeled 


the axes of our first graph with 0 and sin @ to allow use to use “x” and “y” in discussing the unit circle in 
our solution. 


As we explained in the previous part, we have 
sin x = sin(x + 272) 


for all values of x. Therefore, g(x) = sin x is indeed a periodic function, and the fact that sin x = sin(x + 272) 
for all x tells us that its period is no more than 27. We can see from the graph that the period is no smaller 
than 27, but to show this algebraically, we consider the points at which sinx = 1. The only angles x for 
which sin x = 1 are those with terminal point (0,1). The smallest such positive angle is x = 5 and the next 
smallest occurs a full revolution around the unit circle later, when x = =. These two angles for which 
sin x = 1 are 27 apart, and there are no angles x between % and ¥ such that sin x = 1. Therefore, the period 
of sin x cannot be less than 27. 


We’ve now shown that sinx = sin(x + 27) for all x, and that there is no positive value of t smaller than 
t = 27 such that sin x = sin(x + t) for all x, so we conclude that the period of g(x) = sin x is 27. 

If a function is periodic with period t, we sometimes refer to any interval of length ¢ as a “period” of the 
function. So, for example, [0, 27t] is one period of sin x, as are (27, 4r], [6,6 + 27], and any other interval of 
length 27. 


| Problem 2.14: Graph y = cos x. What is the period of cos x? | 


Solution for Problem 2.14: Just as we did with the graph of sin x, we will consider the unit circle to develop the 
graph of y = cos x. As before, we'll use 8 as the angle, to avoid confusion as we refer to the unit circle. 


47 


CHAPTER 2. INTRODUCTION TO TRIGONOMETRIC FUNCTIONS 


For starters, we can consider the unit circle to see that 
cos 0 = cos(@ + 27) 


for all values of 0, because the terminal point of 8 is the same as the terminal point of 0 + 27. Therefore, the graph 
of cos 0 repeats itself horizontally every 27. 


To see how the graph behaves from 0 = 0 to 0 = 27, we consider the unit circle. When 0 = 0, the terminal 
point of @ is (1,0), so cos@ = 1 for O = 0. As 0 increases from 0 to 3, the value of cos @ decreases from 1 to 0, 
which corresponds to the fact that the x-coordinate of the terminal point of 0 goes from 1 to 0 as 0 goes from 0 to 
=. Continuing around the unit circle, as 0 goes from § to 7, the terminal point of 0 goes from (0, 1) to (—1, 0). The 
x-coordinate, and therefore the value of cos 0, goes from 0 to —1. This is illustrated in the graph as the graph goes 
below the horizontal axis when 0 increases past 4. The graph reaches its lowest point at 0 = z, where cos 0 = —1, 
and then goes upward as 0 increases past nz. The resulting graph is shown below. 


cos 0 


Back on the unit circle, as 0 goes from n to 34, the x-coordinate (which is cos 0) of the terminal point goes 
from —1 to 0. Correspondingly, the graph rises as 0 goes from n to 34. The graph intersects the horizontal axis at 
@ = *. It then continues upward, reflecting the fact that the x-coordinate of a fourth quadrant angle’s terminal 
point is positive. The graph again reaches cos 0 = 1 when we complete our revolution about the the unit circle, 
which is when 0 = 2r. We're then back where we started on the unit circle, so the graph as 0 increases from 27 is 
the same as the graph is as 0 increases from 0. Therefore, cosine is periodic with period 27. 


We can replace the horizontal and vertical labels of our graph above to give the graph of y = cos x: 


Figure 2.6: Graph of y = cos x 


o 


The shape of the graphs of sine and cosine is sometimes referred to as a sinusoid or sine curve. You might 
have noticed that these two graphs look very similar. 


| Problem 2.15: What is the relationship between the graph of y = sin x and the graph of y = cos x? What does 
this suggest as a relationship between sine and cosine? 
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ee for Problem 2.15: We graph both y = sinx (solid) and y = cosx (dashed) on the same Cartesian plane 
elow. 


Figure 2.7: Graphs of y = sin x (solid) and y = cos x (dashed) 


The graph of y = cosx appears to be a leftward shift of the graph of y = sinx. This suggests an interesting 
relationship between sine and cosine. Specifically, we let f(x) = sinx. In Section 1.2, we saw that shifting the 


graph of y = f(x) leftward by 5 gives us the graph of y = f (x + z). Our graphs above suggest that this shift 


produces the graph of y = cosx. So, it appears that f (x + z) = cosx for all values of x. That is, it looks like we 
have a new trigonometric identity: 


; TU 
sin. (x + =) = COS X. 


We'll learn how to prove this identity, and many more, in Chapter 3. O 


| Problem 2.16: 
| (a) The graph of y = sinx is symmetric about the origin. That is, if the graph is rotated 180° about the 
| origin, the result is the same as the original graph. What does this tell us about the relationship between 
| sin x and sin(—x)? 

| (b) What symmetry does the graph of y = cosx exhibit, and what does this symmetry tell us about the 
| relationship between cos x and cos(—x)? 


Solution for Problem 2.16: 


(a) Ifa point P is rotated 180° about point O to produce point P’, then O is the midpoint of PP’. Therefore, if 
(a, b) is rotated 180° about (0,0), the result must be (—a, —b), since (0,0) is the midpoint of the segment with 
endpoints (a, b) and (—a, —b). 


Since the graph of y = sin x is symmetric about the origin, we know that for every point (a, b) on the graph 
of y = sin x, the point (~-a, —b) is also on the graph of y = sinx. Because (a, b) and (—a, —b) are on the graph 
of y = sinx, we have b = sina and —b = sin(—a). Putting these together gives us sin(—a) = —b = —sina. 
Therefore, the fact that the graph of y = sin x is symmetric about the origin tells us that for all x, we have 


sin(—x) = —sinx. 
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A function f such that f(-x) = —f(x) for all x in the domain of f is called an odd function. Therefore, 
sine is an odd function. 
(b) The graph of y = cosx is symmetric about the y-axis. This means that for every point (a,b) on the graph, its 
mirror image over the y-axis, which is (—a,b), is also on the graph. Therefore, we have both b = cosa and 
b = cos(—a), so we have cosa = cos(—a) for all a. 


A function f such that f(-x) = f(x) for all x in the domain of f is called an even function. Because 
cos(—x) = cosx 


for all x, cosine is an even function. 


We can prove the relationships suggested in Problem 2.16 by using the unit circle. The 
actions of rotating the point (1,0) an angle 0 counterclockwise and rotating (1,0) an angle 
0 clockwise are symmetric about the x-axis. That is, if P is the result of rotating (1,0) an 
angle 0 counterclockwise about the origin and Q is the result of rotating (1,0) an angle 
0 clockwise about the origin, then P and Q are reflections of each other over the x-axis. 
Therefore, P and Q have the same x-coordinate, but opposite y-coordinates, which means 
that cos(—@) = cos 0 and sin(—@) = — sin @. 


cos(—@) = cos 0, 
sin(—@) = - sin 8. 


Problem 2.17: 
(a) What is the period of tan x? 


(b) Graph y = tanx. 


(c) Wecalla line an asymptote of a graph if the graph gets closer and closer to the line, but does not intersect 
it, as x or y gets far from 0. What lines are asymptotes of the graph of y = tan x? 


Solution for Problem 2.17: 


(a) The period of sin x is 2x and the period of cosx is 27, so we might guess that the period of tan x is 27 as 
well. After all, since sin(x + 271) = sin x and cos(x + 27) = cos x, we have 


Sasa , 
ane sin(x +27) _ sinx 


cos(x + 2r) cosx 


While this does tell us that tan x is periodic, and that the period is no greater than 27, it doesn’t tell us that 
the period is exactly 27, since the period could still be smaller than 27. 
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We turn to the unit circle to determine the period of tanx. We defined tan @ as the slope of the line 
through the origin and the terminal point of 0. As @ ranges from 0 to 3, this slope starts at 0 and increases 
without bound, without ever being the same for two different values of 0. Similarly, as 0 goes from 0 
down to oe this slope starts at 0 and decreases without bound, again without ever being the same for two 
different values of 6. Moreover, the slope is 0 when @ = 0, negative when 0 > 0 > —+, and positive when 
0 < 0 < 3, so there are no two values of @ from —5 to ï that have the same tangent. This means that the 
period of tangent cannot be less than 7. 


But can the period of tangent equal n? Again, the unit circle holds the y 
answer. For any m, the line through the origin with slope m intersects the unit 
circle at two points that are diametrically opposite each other. Therefore, they 
are the terminal points of two angles that differ by z. Moreover, because the E 
same line through the origin passes through both terminal points, the tangents (es X 
of these two angles are the same. Therefore, we have tan(0 + 7) = tan 0 for all 


xX 
0 for which tan @ is defined, so the period of tangent is 7. w 
(b) We did all the hard work in the previous part. Since tanx has period m, we 
can graph y = tan x for a single period, and then just copy it over and over to 


get the graph of y = tanx. We’ll choose to graph tanx first for -f < x < 3, 
since tan x is defined for all values of x in this range. 


As discussed in part (a), we have tan 0 = 0, and tan x increases without bound as x goes from 0 to 5 (but 
tan x is not defined for x = 5, so there are no points on the graph with this value of x). So, as x goes from 0 
to Z, the graph of y = tan x starts at the origin and goes upward forever as x approaches >. 


Figure 2.8: Graph of y = tanx 


The graph does not intersect the line x = 5, because tanx is not defined when x = 3. However, the 
graph does approach this line without ever reaching it. The line x = 3 is called an asymptote of the graph 
of y = tanx. 

Moving over to the other side of the origin, we can follow the same reasoning as above to build the 
graph as x goes from 0 to —§. Since tan x decreases without bound as x goes from 0 to — 4, the graph starts 
at (0,0) and goes downward forever as x approaches -3. Moreover, this portion of the graph is the 180° 


rotation about the origin of the portion for0<x< a because 


sin(—x) _ -sinx 


= = — tanx. 
cos(—x) cosx 


tan(—x) = 


Therefore, tan x is an odd function, and its graph is symmetric about the origin. Just as x = 5 is an asymptote 
of the graph, so is x = =5- 
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Now that we have graphed one full period of tanx, we can repeat this graph indefinitely in both 
directions to produce the graph of y = tan x, as shown above. 


(c) Since the graph repeats in intervals of 71, its asymptotes repeat in intervals of n. We saw above that x = 7 is 


an asymptote of the graph, so other asymptotes are x = 32, x = x = 4 etc., as well as x = —}, x = -%, 
x = —%, etc. We can express these all at once: the asymptotes are x = “ for all odd integers n. Note that 


these values correspond to the values of x for which cos x = 0, which means tan x is not defined. 


O 


Problem 2.18: 
(a) Graph y = csex. 


(b) What is the period of csc x? 
(c) What lines are asymptotes of csc x? 


(d) At what points do the graphs of y = sin x and y = csc x intersect? 


(e) At what points do the graphs of y = cos x and y = csc x intersect? 


Solution for Problem 2.18: 


(a) Since sin0 = 0, csc 0 is undefined, so the graph of y = cscx does not intersect the y-axis. When x is a very 
small positive number, sin x is very small, so csc x is very large. Similarly, when x is the the negative of 
a very small number, sin x is negative, but very close to 0, which makes cscx the negative of a very large 
number. So, just to the right of the y-axis, the graph of y = cscx is very far above the x-axis, while just to 
the left of the y-axis, the graph is very far below the x-axis. 


As x increases from 0 to +, sin x goes from 0 up to 1. As sin x increases, ais decreases until it reaches 1 at 
x = 4. Then, as x goes from 5 up to 7, sin x goes from 1 back down to 0, and csc x increases without bound, 
starting from csc x = 1 at x = 5. Just as the graph does not intersect the line x = 0, it does not intersect x = 71. 


Figure 2.9: Graph of y = csc x 


On the other side of the y-axis, the pattern is repeated, but all the action is below the x-axis. As x goes 
from 0 to —3, sin x goes from 0 to —1, so the graph of y = csc x goes from very far below the x-axis, up to the 


point (-2, -1). Then, as x goes from —4 to —7, sinx goes from —1 back to 0. So, the graph of y = csc x goes 
back down from (-3, -1), continuing indefinitely below the x-axis as the graph approaches, but never 
reaches, the line x = =r. 
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But what happens beyond x = 7 and x = —7? Fortunately, we don’t have to try to figure that out from 
scratch. Since sin x is periodic with period 27, we know that =< must also repeat itself in intervals of 27. 
Since we have already determined the graph from x = —7 to x = n, we can simply repeat this portion of the 
graph every 27 to produce the full graph of y = csc x shown earlier. 


(b) As discussed in the previous part, since the period of sinx is 27, we know that the period of ay is also 271. 


(c) Since sinx = 0 whenever x is an integer multiple of 7, csc x is undefined when x is an integer multiple of 
n. However, x can take on values very close to these integer multiples of n. At these values of x, the graph 
of y = cscx is very far from the x-axis. We therefore see that the lines x = nn, where n is an integer, are 
asymptotes of the graph of y = csc x. 


(d) Ifthe graphs of y = sin x and y = csc x intersect, then we must have sin x = csc x. We therefore must have 
sinx = 1/(sin x), or sin? x = 1, which means sinx = 1 or sinx = —1. This occurs when x = > for all odd 
integers n, which tells us that our graphs intersect at all these values of x. We can see this in the graph 
below, in which y = sin x is graphed as a dashed curve, and y = csc x is graphed as a solid curve. 


Y 


Figure 2.10: Graphs of y = csc x (solid) and y = sin x (dashed) 


(e) Ify = cosx and y = csc x intersect, then we must have cos x = csc x, so cos x = =r z- Therefore, we must have 


(cos x)(sinx) = 1. However, neither cos x nor sin x can exceed 1 or be less than —1. So, if (cos x)(sin x) = 1, 
then we must have cos x = sin x = 1 or cosx = sin x = —1. But both of these are impossible, since sin x = 0 
whenever cos x is —1 or 1. Therefore, the two graphs do not intersect. 


We also could have seen that the graphs do not intersect by noting that the graph of y = csc x never goes 
between the lines y = 1 and y = —1, and the graph of y = cosx never goes outside them. Therefore, the 
graphs can only intersect on one of these lines. The graph of y = cos x only meets these lines when x = nn 
for integers n, but the graph of y = csc x does not have any points for which x is an integer multiple of 7, 
because csc x is undefined for these values of x. Therefore, the graphs do not intersect. 


Problem 2.19: What is the number of solutions of the equation 795 = sinx? (Source: AMC) 


Solution for Problem 2.19: We can’t isolate x or factor this equation in any useful way. We have one obvious solution 
at x = 0, which makes both sides equal to 0, but there aren’t any other obvious solutions. Maybe there are no other 
solutions? 


To get some intuition for the problem, we try graphing both sides of the equation. That is, we graph y = a 
and y = sinx. The intersection points of these graphs correspond to solutions of the equation T = sinx. 


53 


CHAPTER 2. INTRODUCTION TO TRIGONOMETRIC FUNCTIONS 


Figure 2.11: Graphs of y = sin x (solid) and y = 345 (dashed) 


Aha! There are many more solutions. If fact, we see that for each period of sin x, a line that is between y = 0 
and y = 1 will intersect the graph of y = sin x twice—once as sin x goes up from 0 to 1, and then again as sin x goes 
down from 1 to 0. The same is true for any line that is between y = 0 and y = —1 for a full period of sin x. 


To count the number of intersections, we first note that both graphs are symmetric about the origin. So, 
counting the intersections on the right of the y-axis gives us the number of intersections on the left of the y-axis, 
as well. We also have to remember the origin, and add that in at the end. 


WARNING!! Always be on the lookout for special cases. 0 is a common special case. 


In counting the intersections to the right of the y-axis, we note that the period from 0 to 27 has only 1 intersection 
with the line y = 75. After that, each period of 27 has two intersections. For example, we can see that the dashed 
line intersects the sine curve just past x = 27 and just before x = 3n. The line y = 775 will exceed y = 1 when 
x > 100, after which the line will not intersect the graph of y = sinx anymore. So, now we just have to count the 
number of periods up to x = 100. 


To count the number of periods up to x = 100, we note that 
307 ~ 94.2 and 327 ~ 100.48. So, we know that we complete 
15 full periods and then most of another one as x goes from 0 On 317 Om a 
to 100. But how do we know if the line intersects the graph of 
y = sin x twice during the period from x = 307 to x = 327? The diagram above shows the portion of both graphs 
for values of x from a little less than 307 to a little more than 327. This picture isn’t terribly illuminating. It’s 
not even obvious from the diagram that graphs don’t intersect for x > 327. We have to use our observation that 
327 > 100 to be sure that an > sin x for all x > za, since D > 1 > sinx for x > 327. Perhaps we can come up with 
a similar argument to see that the graphs intersect twice when 307 < x < 317. 


We consider where in the period the line meets the sine curve. The sine curve is above the x-axis in the first half 
of each period (from x = 0 to x = 7, from x = 27 to x = 37, etc), so it is above the x-axis from x = 307 to x = 317. 
Since the line is above the x-axis when x > 0, both of its intersections with y = sin x in the interval 30r < x < 327 
must be in the range 307 < x < 31m. Now, we see the line does indeed intersect this final period twice, because 
31m ~ 97.34. 


There is one subtle point the previous paragraph overlooks. We have shown that if the graph of the line 
intersects the period of sinx from x = 307 to x = 327, then it does so between x = 307 and x = 31r, but we 
haven’t shown that it’s impossible for the line to be tangent to the graph of y = sinx, so that there is only one 
intersection point rather than two. To see why there are at least two intersection points, we note that sin 30.57 = 1, 
and (30.57)/100 = 0.9582, which is less than 1. Since sin 307 = sin31r = 0, we see that the graph of y = 75 is 
above the graph of y = sin x when x = 307, below it when x = 30.57, and above it again when x = 317. Therefore, 
the line intersects the graph of y = sin x at more than 1 point for 307 < x < 327. (We need more advanced tools to 


give a rigorous explanation why the line doesn’t intersect y = sin x at more than two points for 307 < x < 327.) 


We're now ready to count all the intersections. To the right of the y-axis, the line intersects the sine curve 1 time 
in the first period, and 2 times in each of the next 15 periods, for a total of 1+ 2-15 = 31 intersections. Both graphs 
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are symmetric about the origin, so they intersect 31 times to the left of the y-axis, too. Adding in the intersection 
at the origin gives a total of 31 + 31 + 1 = 63 intersections, so there are 63 solutions to the equation 779 = sinx. O 


Concept: Graphs can help provide intuition for problems involving trig functions. — 


|| __Exercises M 


(a) Graph y = secx. 
(b) What is the period of sec x? 
(c) What is the relationship between the graph of y = sec x and the graph of y = csc x? 


2392 


(a) Graph y = cotx. 
(b) Whatis the period of cot x? 


2.3.3 At how many points with x-coordinate between 0 and 100 do the graphs of y = sin x and y = cos x intersect? 


2.4 Transformations of Trig Functions 


For the graphing problems in this section, try using the methods we discussed in Section 1.2 for transforming 
graphs of functions. After producing each graph, test your answer by trying various values of x, finding the 
corresponding value of y, and checking that the point (x, y) is indeed on the graph you produced. 


[Problems E 
Problem 2.20: Graph y = 2sinx and y = sinx + 2. 


Problem 2.21: 


(a) Graph y = cos 2x. 
(b) What is the period of cos 2x? 
(c) Whatis the period of cosax, where a is a positive constant? 


Problem 2.22: What is the period of sin (-3)? 


Problem 2.23: Is f(x) = sin ¥ + cos 2x periodic? If so, what is the period? 


Problem 2.24: The phase shift of a function of the form f(x) = asin(bx + c), where a, b, and c are constants, is the 
smallest horizontal distance by which the graph of g(x) = a sin bx must be shifted to produce the graph of f. We 
typically use positive values for the phase shift if this shift is to the right, and negative if the shift is to the left. 


(a) What is the phase shift of sin (x _ z) ? 


(b) Graph y = sin (x — z). 
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Problem 2.25: Graph y = sin x + sin(x — n). What does the graph suggest about the relationship between sin x 
and sin(x — 71)? 


Problem 2.26: 
(a) What is the phase shift of ~2 sin (3x — 4)? (Note: The answer is not $!) Hints: 62 
(b) What is the period of —2 sin (ax — z) 2 


(c) Graph y = —2sin (3x — z). 


Problem 2.27: A Ferris wheel is a ride that consists of seats connected to the circumference of a large vertical 
wheel. As the wheel rotates, the riders are moved from the bottom of the wheel to the top of the wheel, and 
then back to the bottom. When you first get on a particular Ferris wheel, you are at the bottom of the ride, 3 feet 
off the ground. Once the Ferris wheel starts, it rotates around its center at a constant rate. Suppose that it takes 
you 8 seconds to reach the highest point of the ride, 63 feet off the ground. 
(a) You start at the bottom of the ride. How high off the ground will you be 5 seconds after the ride starts? 
(Answer to the nearest tenth of a foot.) 


(b) Find a function f such that f(t) equals how high in feet you will be off the ground t seconds later. 


Solution for Problem 2.20: We can produce the graph of y = 2 sin x by scaling the graph of y = sin x vertically away 
from the x-axis by a factor of 2. The resulting graph is shown below: 


Figure 2.12: Graphs of y = 2sinx (solid) and y = sin x (dashed) 


The only difference between the graph of y = 2sinx and the graph of y = sinx is the vertical scale. We use 
amplitude as a measure of this vertical scale, where the amplitude equals half the vertical distance between a 
maximal (highest) and minimal (lowest) point of the graph. Equivalently, we can say that the amplitude of a sine 
(or cosine) function is half the difference between the maximum and minimum values of the function. So, the 
amplitude of sin x is 1, and the amplitude of 2 sin x is 2. 


We produce the graph of y = sin x + 2 simply by shifting the graph of y = sin x up by 2 units, as shown atop 
the following page. 
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Figure 2.13: Graphs of y = sin x + 2 (solid) and y = sin x (dashed) 


Notice that shifting the graph vertically does not affect the vertical distance between a maximal (highest) and 
minimal (lowest) point on the graph. Therefore, the amplitude of the graph of y = sinx + 2 is the same as the 
amplitude of the graph of y = sin x, which is 1. O 


As an Exercise, you'll show that the amplitude of asin x, where a is a constant, is Jal. 
Problem 2.21: 
(a) Graph y = cos 2x. 
(b) What is the period of cos 2x? 


(c) What is the period of cosax, where a is a positive constant? 


Solution for Problem 2.21: 


(a) In Section 1.2, we learned that the graph of y = f(2x) is a horizontal scaling of the graph of y = f(x) towards 
the y-axis by a factor of 1/2. So, to produce the graph of y = cos 2x, we scale the graph of y = cos x towards 
the y-axis by a factor of 1/2, producing the graph below (the scaling of three specific points on the graph of 
y = cos x is indicated by the grey arrows): 


Figure 2.14: Graphs of y = cos 2x (solid) and y = cos x (dashed) 


(b) The graph of y = cosx repeats every 27. Since we compress this graph horizontally by a factor of 1/2 to 
produce the graph of y = cos 2x, the graph of y = cos 2x repeats every 2n(1/2) = n. Therefore, the period of 
cos 2x is T. 

(c) The graph of y = cosax results from compressing the graph of y = cosx horizontally by a factor of 1/a. 
Since the graph of y = cos x repeats every 27, the compressed graph of y = cosax will repeat every 271/a. 
Therefore, the period of cos ax is 27/a. 


EE EE ao 
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Problem 2.22: What is the period of sin (-2) 2 


Solution for Problem 2.22: We first deal with the negative sign. Since the graph of y = f(-x) is the reflection of 
the graph of y = f(x) over the y-axis (see page 14), the graph of y = sin(-x) is the reflection of the graph of 
y = sinx over the y-axis. Reflecting the graph of a periodic function over the y-axis doesn’t have any effect on 
how frequently the graph repeats, so the period of sin(—x) is the same as the period of sin x, which is 27. 


Next, we deal with the fraction. The graph of y = f(x/3) is a horizontal stretching by a factor of 1/(1/3) = 3 
away from the x-axis. Therefore, the graph of y = sin (=z) results from stretching the graph of y = sin(—x) 
horizontally by a factor of 3. Since the graph of y = sin(—x) repeats every 2r, the graph of y = sin (=z) repeats 
every 3(27) = 67. Therefore, sin (-5) has period 67. O 


We ec follow the same reasoning as we did in Problems 2.21 and 2.22 to see that the period of both sinax and 


COs ax is A Al , where a is a nonzero constant. 


Problem 2.23: Is f(x) = sin = + cos 2x periodic? If so, what is the period? 


Solution for Problem 2.23: We'll start with the graph of y = sin ¥ + cos 2x, and check if the graph repeats itself 
regularly. Of course, you wouldn’t want to build such a graph by hand! You’d use a computer, just like we did to 
produce the graph below: 


Figure 2.15: Graphs of y = sin 3 * + cos 2x (solid), y = sin $ z (dashed), and y = cos 2x (dotted) 


The solid graph appears to peak at x = —5, x = —m, and x = 3n. Each peak is 47 after the previous peak. The 
pattern of the solid curve from x = —5z to x = -r appears to be exactly the same as the curve from x = —7 to 
x = 37, and the pattern appears to continue after x = 37. It looks like the graph repeats itself, so we suspect the 
function f(x) = sin + cos 2x is periodic. But how can we be sure? And if it is periodic, why is it periodic? 


The large peaks give us one hint: these appear to occur where both sin # and cos 2x equal 1. So, we investigate 
sin 3 and cos 2x wo with special attention on what values of x make either equal to 1. Both equal 1 at 
x= a ae i increases, sin ¥ again equals 1 after is has gone through a full Loe One period of sin = oe oa. 


27/(3/2) = 7 so sin # ne at =e E 1 = $,and again at x = 3 x44 a =% , and again at x = Sn = 37. 


Moseo ils, cos 2x has period (27)/2 = 7, so cos2x equals 1 at x = 0, m, 27, 37, and so on. Indeed, both 
sin % and cos2x equal 1 again at 37, and no we have an explanation for why they will both equal 1 again at 
x = 3n + 4n = 7n. As x increases by 4r, sin % goes through exactly 3 full periods, while cos 2x goes through 
exactly 4 full periods. So, increasing x by i leaves both sin *¥ and cos 2x unchanged, which means that for 
f(x) = sin = + cos x, we have f(x + 47) = f(x) for all x, so f is periodic. To see that the period is not smaller than 


4n, we note that the values of x at which f(x) = 2 occur at intervals of 47. O 


The phase shift of a function of the form f(x) = asin(bx + c), where a, b, and c are constants, is the smallest 
horizontal distance by which the graph of g(x) = asin bx must be shifted to produce the graph of f. We typically 
use positive values for the phase shift if this shift is to the right, and negative if the shift is to the left. 


58 


2.4. TRANSFORMATIONS OF TRIG FUNCTIONS 
—<—— O AT NG VE TRIS PUN LING 


| Problem 2.24: 
| (a) What is the phase shift of sin (x - x) ? 


(b) Graph y = sin (x - z), 


Solution for Problem 2.24: 


(a) As explained on page 13, the graph of y = f(x —c) is a c-unit rightward shift of the graph of y = f(x). 
Therefore, the graph of y = sin (x — z) is a 5 rightward shift of the graph of y = sin x, so the phase shift is 


m 


oe 
Another way to determine the phase shift is to compare the values of x such that sinx = 1 to those at 
which sin (x = z) = 1. We have sinx = 1 for x = 5, and at intervals of 27 thereafter. We have sin (x = z) =1 


for x = 7, and at intervals of 27 thereafter. So, the points at which sin (x = x) = 1 are Ẹ to the right of the 
points at which sinx = 1. 


(b) Since we found in part (a) that the graph of y = sin (x- z) is > rightward shift of sinx, we graph 


y = sin (x = z) by shifting the graph of y = sin x by § to the right, as shown below: 


Figure 2.16: Graphs of y = sin (x = z) (solid) and y = sin x (dashed) 


O 


‘Problem 2.25: Graph y = sin x + sin(x — 7). What does the graph suggest about the relationship between sin x 
and sin(x — 7)? 


Solution for Problem 2.25: The graph of y = sin(x — 71) is a m-unit rightward shift of the graph of y = sin x. We graph 
y =sinx and y = sin(x — 7) below: 


Figure 2.17: Graphs of y = sin(x — n) (solid) and y = sin x (dashed) 


The graph of y = sin(x—7) appears to be the reflection of the graph of y = sin x over the x-axis. When we add the 
y-coordinates of the points on the graphs for each value of x, we always get 0, so the graph of y = sin x + sin(x — 7) 
is just the x-axis. This suggests that we have sinx + sin(x — 7) = 0 for all values of x. That is, we appear to have 
another identity: sin(x — 7) = — sin x. We will learn how to prove this identity algebraically in Chapter 3. D 
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Problem 2.26: 
(a) What is the phase shift of —2 sin (3x = x) 2 


(b) What is the period of —2 sin (3x - z)? 


(c) Graph y= -2sin (3x = z). 


Solution for Problem 2.26: 


(a) What's wong with this solup on 


The T of y = a sin (3x = 7) 3 is a 4 rightward shift of the graph of 


Boei Solutibn: 
: v= = —2 sin ee so the a a BA 


That there’s a mistake here is clear when we consider where the ape of y = —2sin (3x = z) and 
y = —2sin3x cross the x-axis. Since sin0 = 0, the graph of y = —2sin3x crosses the x-axis at x = 0, 
while the graph of y = —2sin (3x = z) crosses the x-axis where 3x — | = 0, which is at x = 7. Similarly, 
each x-intercept of the graph of y = —2sin (3x = z) is 75 units to the right of an x-intercept of the graph 
of y = —2sin3x. So, this suggests that the graph of y = —2sin (3x = 7 is a 75 unit rightward shift of 
y = —2sin 3x. 

The heart of the error in the Bogus Solution is incorrectly applying the lessons of page 13 regarding 
horizontal shifts of graphs of functions. We learned there that the graph of y = f(x —c) is a c-unit rightward 
shift of the graph of y = f(x). However, if we let f(x) = —2sin3x, then f (x = z) = —2sin (3x = an), which 


is not the same as —2 sin (3x — z). Instead, we note that 


-2an (21-7) -250 6f- 5) -76-5 


so the graph of y = —2sin (3x = z) is indeed a 75 rightward shift of the graph of y = —2 sin 3x, which means 
that the phase shift is 75. 


(b) As explained in the previous part, the graph of y = —2sin (ex = z) is a rightward shift of the graph of 


y = —2sin3x. Translating a sine curve has no effect on its period, so the period of —2 sin (3x = z) is the 
same as the period of —2sin3x. Similarly, the coefficient —2 in —2 sin 3x has no impact on the period; it 
merely scales the graph vertically, which doesn’t affect how frequently the graph repeats itself horizontally. 
So, the period of —2sin3x is the same as the period of sin 3x, which is 27/3. Therefore, the period of 
—2sin (3x — z) i is 27/3. 


(c) We can retrace our steps in the previous two parts to produce the graph. We start with the graph of 
y = sin 3x, which is the graph of y = sin x compressed horizontally towards the y-axis by a factor of 3: 
1 


Figure 2.18: Graph y = sin 3x 
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The graph of y = 2 sin 3x is the result of scaling the graph of y = sin 3x vertically away from the x-axis bya 
factor of 2. The graph of y = -2 sin 3x is the result of reflecting the graph of y = 2 sin3x over the x-axis (see 
page 14 if you don’t see why), which gives us the graph below: 

y 


Figure 2.19: Graph y = -2 sin 3x 


Finally, as explained in part (a), the graph of y = -2 sin (3x = z) is a 7 rightward shift of the graph of 
y = -2 sin 3x: 


Figure 2.20: Graph y = -2 sin (3x = z) 


Sidenote: One characteristic of trig functions that is used in physics is frequency. We often 
| use frequency in conjunction with trigonometric functions of time. For example, 
we might have the function A(t) = 3 sin(127#), where t is in seconds. The frequency 
of this function is the number of periods the function completes in 1 second—that : 
is, the frequency tells us how frequently the function repeats. See if you can figure 
out why this definition is equivalent to defining frequency as the reciprocal of the 


| period. 


One of the reasons we study complicated-looking trigonometric functions like f(x) = 3 sin(4x + 7) is that many 
processes in nature can be modeled with functions like these. You'll see such functions throughout your study of 
physics. For example, when you hear someone talk about frequency or wavelength in conjunction with sound, 
light, or even rays from outer space, they're describing a characteristic of a trigonometric function that can be 
used to model the sound, the light, or the space ray. 


We say that f(x) varies sinusoidally with a variable x, and that f is sinusoidal, if we can express f as 
f(x) = asin(bx + c) + d for some constants a, b, c, and d. 


61 


CHAPTER 2. INTRODUCTION TO TRIGONOMETRIC FUNCTIONS 


Problem 2.27: A Ferris wheel is a ride that consists of seats connected to the circumference of a large vertical 
wheel. As the wheel rotates, the riders are moved from the bottom of the wheel to the top of the wheel, and 
then back to the bottom. When you first get on a particular Ferris wheel, you are at the bottom of the ride, 3 
feet off the ground. Once the Ferris wheel starts, it rotates around its center at a constant rate. Suppose that it 
takes you 8 seconds to reach the highest point of the ride, 63 feet off the ground. 
(a) Youstart at the bottom of the ride. How high off the ground will you be 5 seconds after the ride starts? 
(Answer to the nearest tenth of a foot.) 


(b) Find a function f such that f(t) equals how high in feet you will be off the ground ¢ seconds later. 


Solution for Problem 2.27: 


(a) In the diagram at right, you start at point B, and point C is the center 
of the Ferris wheel. After the Ferris wheel rotates an angle @, you are 
at point D, as shown. Pretend for a moment that the Ferris wheel is the 
unit circle. Point B is directly below point C, so its coordinates would be 

3 

F. 

would be (cos (3 + 9) „sin (3 d a 

Next, we note that the Ferris wheel has radius 30 feet, since the distance 
from the bottom to the top of the wheel is 60 feet. So, CB = 30. Therefore, 


if we think of the Ferris wheel as a circle centered at the origin with radius 


30, rather than radius 1, then the coordinates of B are (30 cos ¥,30 sin ar) and the coordinates of D are 


(cos sin ar), Since point D is 0 counterclockwise from B, its coordinates 


Ground 


(30 cos (3z + o) ,30sin (= + a) ) . The y-coordinate of D tells us how far you are above (when it is positive) 
or below (when it is negative) the center of the Ferris wheel after the wheel has turned an angle @. Therefore, 
since the center of the Ferris wheel is 33 feet off the ground, your distance in feet from the ground after the 
wheel has turned an angle @ is given by the function 


g(9) = 30 sin (= + 0) +33. 


So, we just have to figure out the angle @ after the wheel has turned for 5 seconds. The wheel rotates an 
angle 7 in 8 seconds, so in 5 seconds, it rotates by 57/8. Therefore, your distance from the ground after 5 


seconds is 
5m 3m 57 


: eon CEE 

g (=) = 30sin (= +P =") + 33 = 30 sin —— + 33 ~ 44.5 feet. 

(b) We convert our function in terms of the angle 0 into a function in terms of the time t by finding the 
relationship between 0 and t. Since the wheel turns an angle of z in 8 seconds, in ¢ seconds it turns an angle 
0 = in. Therefore, the desired function is 


tn 3n 
H= i — + — ; 
fb 30sin (2 + 2) +33 
This function looks a lot like the others we have studied in this section. We can confirm our answer by 
using our understanding of functions of the form f(t) = asin(bt + c) + d. 


First, we note that the a sin(bt + c) term gives the vertical displacement between you and the center of 
the Ferris wheel (a sin(bt + c) is positive if you are above the center, and negative if you are below it). Your 
height from the ground is the sum of this displacement and the distance from the center of Ferris wheel to 
the ground, which is 33 feet. Therefore, we must have d = 33. 


Next, because you go from 30 feet below the center to 30 feet above the center, we know. that a sin(bt + c) 
varies from —30 to 30. That is, the amplitude of a sin(bt + c) is 30, so we can let a = 30. 
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The Ferris wheel makes a full rotation in 16 seconds, so the period of a sin(bt + c) must be 16 (assuming t 
is in seconds). Therefore, b must be (27)/16, which equals 7/8. Finally, we must account for the phase shift. 
You are at the bottom of the Ferris wheel at time t = 0, so asin(bt + c) must be as small as possible when 
t = 0. This means we can let c = 37/2, since sin(37/2) gives us the smallest possible value of sine. Putting 
all these observations together gives us 


f(t) = 30sin (= =) T33, 


which matches the function found earlier. 


g 


ica > 


2.4.1 Show that the amplitude of f(x) = asin x is Jal. 


2.4.2 Graph y =3 + sinx. 
2.4.3 
(a) Graph y = cos 2x. 
(b) Graph y = -3 cos 2x. 
(c)* Graph y = -3 cos (2x = z). 


2.4.4 Ted chooses two numbers t and u, and then graphs the function g(x) = tsin ux. The resulting graph is 
shown below. If t is positive, find t and u. 


AAA 


Figure 2.21: Graph for Exercise 2.4.4 


nja 


2.4.5 Find the period, amplitude, and phase shift of f (x) = -3 cos(2x + T) + 4. 


2.4.6 A block is attached to a long spring that is connected to the ceiling. The block is pulled away from the 
ceiling and then released, so that it bounces up and down, as the spring stretches and contracts, in a way that 
makes the distance from the block to the ceiling vary sinusoidally with time. Suppose the distance between the 
block and the ceiling varies between 1.2 meters and 1.8 meters, and that it takes 1 second for the block to go from 
high point to low point. How far from the ceiling is the block 2.7 seconds after it is at a high point? 


a ND 
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2.4.7 Benny graphs a function of the form 
f(x) = cos(ax) 


for some positive number a. Joon graphs a function of the form 
f(x) = sin(bx + c) 


for some positive number b and real number c. They produce the same graph, which is shown below. 


Figure 2.22: Graph for Exercise 2.4.7 


(a) What is a? 
(b) What is b? 
(c) Find three possible values of c. 


2.4.8 Is the function f(x) = 2sin x — tan x periodic? If so, what is the period? 


2.5 Inverse Trig Functions 


So far in this chapter, we’ve mostly concerned ourselves with finding the output of trigonometric functions given 
certain inputs. In this section, we turn this around and try to find the inputs we must give to trig functions to 
produce certain desired outputs. 


J) Problems iis 


Problem 2.28: A cart is attached to a spring that is connected to a wall. The cart is pulled 8 feet away from the 
wall and then released, so that it rolls back and forth towards and away from the wall as the spring stretches and 
contracts. The distance from the cart to the wall varies sinusoidally with time. Suppose the distance between 
the cart and the wall varies between 4 feet and 8 feet, and that it takes 6 seconds for the cart to go from the point 
closest to the wall to the point that is farthest from the wall. 


(a) Find a function that describes how far the cart is from the wall t seconds after it is released. 


_ (b) How many seconds pass before the cart is exactly 5 feet from the wall for the first time? 


(c) Find all values of t such that the cart is 5 feet from the wall after t seconds. 


Problem 2.29: 
(a) Explain why the function f(x) = cos x does not have an inverse. 


(b) Suppose we define a function g(x) = cos x and restrict the domain of g to an interval [a,b], where a and b 
are constants, such that g does have an inverse. What are valid choices for a and b? What are good choices 


for a and b? 


2.5. INVERSE TRIG FUNCTIONS 


Problem 2.28: A cart is attached to a spring that is connected to a wall. The cart is pulled 8 feet away from the 
wall and then released, so that it rolls back and forth towards and away from the wall as the spring stretches 
and contracts. The distance from the cart to the wall varies sinusoidally with time. Suppose the distance 
between the cart and the wall varies between 4 feet and 8 feet, and that it takes 6 seconds for the cart to go 
from the point closest to the wall to the point that is farthest from the wall. 


(a) Find a function that describes how far the cart is from the wall t seconds after it is released. 
(b) How many seconds pass before the cart is exactly 5 feet from the wall for the first time? 


(c) Find all values of t such that the cart is 5 feet from the wall after t seconds. 


Solution for Problem 2.28: 


(a) Because the distance between the cart and the wall varies sinusoidally with time, this distance is 
f(t) = asin(bt + c) +d, 


where a, b, c, and d are constants. The middle point of the cart’s path is 6 feet from the wall, and it varies 
2 feet from this point in both directions. So, the amplitude of asin(bt + c) is 2, and f(t) must equal 6 when 
asin(bt + c) is 0 (when the cart is at the midpoint of its path). Therefore, the function is f(t) = 2 sin(bt +c) + 6 
for some values of b and c. 


The cart starts 8 feet away from the wall, so we must have f(0) = 8. Therefore, we must have sin(bit+c) = 1 
when t = 0, which means sinc = 1. This means we can let c = 7, and we have 


f(t) =2sin (ut i =) tE 


(Note that we could let c = $, or —3#, or any number that is a multiple of 27 from 3.) 

Finally, we turn to the last piece of information we have: the cart goes from the farthest point from the 
wall to the closest point in 6 seconds. This means that it goes from the farthest point to the closest and 
back to the farthest again in 12 seconds, so its period is 12 seconds. Since the period of f(t) is 2% when b is 


positive, we must have b = x /6, and we have 


f(t) =2sin Ge 5) +4. 


(b) We seek the smallest time t such that we have f(t) = 5, so we have 
o/m n * 
2sin (Zt) +4 5, 


Isolating the sine function, we have 
ee ed 
sin| 7t- 5) = 5° 
We now need to find an angle whose sine equals to 5. Fortunately, we've seen sin 0 = 5 enough times by 
now to remember that sin Ẹ = 7. Therefore, if 2 - 3 = %, then the equation above is satisfied. Solving 
for t gives t = 4. We still have to show that no smaller t is possible. The cart goes from 8 feet from the 
wall to 4 feet from the wall during the first 6 seconds, and the distance between the cart and the wall is 
strictly decreasing during that time. This means the cart can only be 5 feet from the wall once in those first 
6 seconds. Since the cart is 5 feet from the wall when t = 4, this is smallest possible positive value of t. 


(c) What's wrong with this solution: 


S mm I Immm ammm 
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Bogus Solution: The period of sine is 271, so the value of sin (z igo 7) ee every time 


the argument increases by 27. Since we have sin (z a z) = + when 

À Zt- 3% = EZ, we musthave Zt — 3 = 4 +k(27) for some A integer. 
k. (No going backwards in time! We must have k > 0.) Therefore, the 
times at which the cart is 5 feet from the wall are £ = 4 + 12k, for any 
nonnegative integer k. . 


Wes see hee error in this reasoning by stepping reek cnt ie F nate TARE T ae tet 


a that you overlook what the problem i is about. Keeping the original problemi 
in mind can help you catch errors in your thinking. 


WARNING!! When using math to model a problem, don’t get so wrapped up in the math 


The c cart is 35 ga from the wall twice during each period—once as it approaches the wall and once as it 
moves away from the wall. The Bogus Solution only includes the times that are one full period from the 
first time the cart is 5 feet from the wall, which are the times during which the cart is approaching the wall. 
(We also could have caught this anai by thinking about the unit circle—there are two points on H unit 
circle with y-coordinate equal to 4, so there are two angles 0 in each period of sine such that sin 0 = 3.) 


Here are a couple ways to finish the problem: 


Solution 1: Use A Oa We can use the unit circle to find the other 
set of angles whose sine is 4. In the a point P is the second quadrant p 
point on the unit circle with y-coordinate }. We draw altitude PA as shown; 
the length of this altitude equals the y-coordinate of P, so PA = 3. Since leg (Lose 
PA of right triangle PAO has length } the hypotenuse of PAO, the triangle is A 
a 30-60-90 nae triangle with ZPOA = 30° = Z. Therefore, P is the terminal 
point of x — Z = %. 
So, in nes to the values of t that we found in the Bogus Solution, we 
also have sin (z: = z) = 5 when Zt- % = % + (27)k, for any nonnegative 
integer k. Solving for t gives t = 8 + 12k, where k is a nonnegative integer. Combining this with the earlier 


values we found, the cart is 5 feet from the wall when t = 4 + 12k seconds or t = 8 + 12k seconds, for any 
nonnegative integer k. 


Solution 2: Use Symmetry. Once again, we help ourselves by thinking about the original problem rather 
than staying focused on our equations. The cart goes from 8 feet away to 5 feet away in 4 seconds. Since 
it takes 6 seconds to go from 8 feet away to 4 feet away, it will take 6 — 4 = 2 more seconds for the cart to 
complete its first trip from farthest point to nearest point. By symmetry, it will then take 2 more seconds to 
return to the point 5 feet from the wall, so 4 + 2 + 2 = 8 seconds after the cart is released, it will again be 5 
feet from the wall. 


Since the period of the cart is 12 seconds and the two times in the first period at which the cart is 5 feet 
from the wall are at t = 4 seconds and t = 8 seconds, the values of t for which the cart are 5 feet from the 
wall are t = 4 + 12k seconds or t = 8 + 12k seconds, for any nonnegative integer k. 


Problem 2.29: 


(a) 
(b) 


Explain why the function f(x) = cos x does not have an inverse. 


Suppose we define a function g(x) = cos x and restrict the domain of g to an interval [a,b], where a and $ 
b are constants, such that g does have an inverse. What are valid choices for a and b? What are good f 


choices for a and b? 


2.5. INVERSE TRIG FUNCTIONS 


Solution for Problem 2.29: 


(a) A function is invertible if and only if there is exactly one input that produces each possible output from 
the function. Since cos0 = 1 and cos2n = 1, the function f(x) = cosx has two inputs that produce the 
same output, so it is not invertible. Moreover, since cos x is periodic, all possible outputs of f(x) occur for 
infinitely many inputs. 

(b) Having noted that cos x is periodic, we might think that if we restrict the domain of g to a single period 
of cos x, then we will be able to invert it. But this fails, too, because there are many values of y such that 
y = cos x for two different values of x in the same period. For example, if we consider the period consisting 


of the interval [0, 272), we have cos 5 = cos an and cos 5 = cos m, and so on. So, we can’t take a full period 
as the domain of g. 


If we take a look at the graph of y = cos x, we can quickly see what sorts of intervals we can take as the 
domain of g such that g is invertible. 


Figure 2.23: Graph of y = cos x 


If a function is invertible, then no two points on its graph can have the same y-coordinate. So, we can 
choose a suitable domain such that g is invertible by picking an interval such that the graph of y = cosx 
has no two points with the same y-coordinate when x is in the interval. 


Now, we have some choices to make—there are infinitely many ways we can choose this interval. We 


could choose [o, z] , for example. But this wouldn't be such a useful function. We couldn't use it for cases in 


which cosine is negative, for example. We'd like to choose an interval that provides all the possible outputs 
of cosx. In other words, we'd like the range of g to be [-1, 1], because then it is the same as the range of 
cos x when there are no restrictions on the domain. Looking at the graph of y = cos x, or thinking about the 
unit circle, we see that the interval [0, 77] fits the bill nicely, since cos x takes on every possible value of cos x 
once and exactly once for x € [0, 77]. 


The interval [0,7] isn’t the only possible domain for g such that g is invertible and has range [-1, 1]. 
We could have chosen [7,27], or even a union of intervals such as lo, z] U [3r, m), where g7! (x) is in the 
interval [o, z] if x is nonnegative and g™! (x) is in [37, Zm) if x is negative. 

Any of these choices is perfectly fine for the domain of g. By convention, when we wish to have a 
version of cosine that is invertible, we restrict the domain to [0, n]. As we’ve seen, this allows us to find a 
unique input for any possible output of cosine. Moreover, [0,7] is a single interval rather than a disjoint 
combination of intervals, and it’s the easiest single interval to remember, perhaps because it’s “natural” to 
start from 0 and go up. 


o 


We have a special family of functions that return an angle given the value of a trigonometric function of that 
angle. These are the inverse trigonometric functions, and they are the inverses of the trigonometric functions 
when the domains of these functions are suitably restricted to make the resulting function invertible. 


For example, we define the function f(x) = arccos x to output the angle in the interval [0, 7] whose cosine is x. 
The domain of arccos x is [—1, 1], since the cosine of an angle must be in this interval. As described in Problem 2.29, 
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[0, z] isn’t the only interval we could have chosen to be the range of arccos x, but it is the convention we typically 
use for the reasons explained in Problem 2.29. So, while there are infinitely many angles x such that cos x = 3, the 
value of arccos 4 is the angle in the interval [0, 7] such that cos x = $. Therefore, we have arccos poe. 

With arccosx defined in this manner, we have cos(arccosx) = x for all x in the domain of arccosx, and 
arccos(cos x) = x for 0 < x < n. Therefore, we can think of arccos x as the inverse function of cos x in that the 
functions “undo” each other, where we restrict the domain of cos x to [0, 7] in order to define arccos x. 


We call the function arccos x “arccosine” or “inverse cosine.” Sometimes this function is denoted cos™ x. 


We'll use both arccos x and cos“! x interchangeably in this section so you become used to both notations, but for 
the rest of the text we'll prefer arccos x to avoid confusion, since some might interpret cos~' x as the reciprocal of 
cos x. (We have the function sec x, which equals the reciprocal of cos x, to help us avoid this confusion.) 


There’s nothing special about cosine; we have inverse functions for each of the primary trigonometric functions. 
Just as with arccos x, we must place special restrictions on the ranges of the inverses of the other trig functions. 
Also just as with arccos x, the specific choices we make for these restrictions are matters of convention, and not the 
only choices we could have made. Below is a table of the inverse trig functions; as you read over it, think about 
why we choose the indicated ranges. 


Inverse Trig Function Range of Inverse 
CLI 


4 
R (0, 7) 
z 
2 


T 


Don't memorize all these domains and ranges right now. While working on the following problems, you'll 
learn more about why we must restrict the domains and ranges as described in the table above. Most of the 
restrictions will become obvious to you, rather than something to be memorized. 


)_Proviems lg 


Problem 2.30: Evaluate each of the following: 
(a) arcsin0 (c) arctan1 (e) arccot ¥3 


(b) cos"! 8 (d) arcsin (-¥) (f) sec! 248 


Problem 2.31: If x is nonzero and arccos x = arcsec y, then find y in terms of x. 


Problem 2.32: 
(a) Why is [-1,1] the domain of arccos x, but the domain of arctan x is R? 


_ (b) We use [0,7] as the range of arccos x. Why can’t the range be a smaller or larger interval? 


(c) Why can’t we also use [0, 7] as the range of arcsin x? 


Why must we exclude the endpoints of the interval (-4, x) from the range of arctanx? ` 
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Problem 2.33: Evaluate each of the following without a calculator: 


(a) arcsin(sin(397/5)) (b) — arcsin(sin(377/5)) 


Problem 2.34: 
(a) Evaluate sin(arccos 0.6) without a calculator. 


(b) Evaluate sin(arccos(—0.6)) without a calculator. 


Problem 2.35: Let y = arcsin x. In this problem, we find tan y in terms of x. 
(a) Write the equation y = arcsin x as an equivalent equation without an inverse trig function. 


(b) Write tan y in terms of sine and cosine. 


(c) Find cos y in terms of x. 


Find tan y in terms of x. 


Problem 2.36: Graph y = arcsin x. How is this graph related to the graph of y = sin x? 


Problem 2.37: In this problem, we find all t such that 2 cos (3t = z) aus. 
(a) Find all t such that cost = ae 
(b) Find all t such that cos (t — z) = = 


1) = V8. 


Problem 2.38: What angles ¢ satisfy sec ọ tan @ = —3? 


Problem 2.30: Evaluate each of the following: 


(c) Find all £ such that 2 cos (3t — 


(a) arcsin0 (c) arctan1 (e) arccot V3 


(b) cos! = (d) arcsin (-£) (f) sec} ae 


Solution for Problem 2.30: 


(a) Since sin0 = 0, we have arcsin0 = 0. 


r =) 0k 
(b) Since cos ¥ = 28 we have cos7! < =. 


(c) Ifx = arctan1, then sin x and cosx must be equal, so sini tan x will equal 1. The only angle in the interval 


(-3, x) for which this is the case is 7, so arctan] = | 
(d) We know that sin x equals -5 M2 oy -=2 for odd integer N of 7. Here, we want the one in the interval 
|-2 F 4 for which sine is negative, so we have arcsin (-) = =f, 


(e) If cotx = V3, then we have 9+ = V3. The presence of V3 suggests testing x = Z and x = 3. The former 


sinx 
works, so arccot V3 = ee 


If we hadn't pest of testing x = fe = v3, we have cosx = V3sinx. The 


identity sin? a +cos’ x = 1 also relates sin x Da cos x. E cosx = V3sin x into this gives Aone = ib 
so sin?x = 1. Therefore, we have sinx = +5. Since the range of arccott is (0,72) and cotx is negative for 
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Z <x < n, we must have 0 < arccot y3 < Fes This means sin x > 0, so sin x = 5 and we have x = aa as before. 


(f) We seek the angle x such that sec x = = We’re a lot more comfortable with sine and cosine than we are 
with secant, so we ‘rewrite this equation as 


i 28 


cosx 3 
Taking the reciprocal of both sides gives us 
ree eee 
2V3 2° 
and we're in familiar territory. In fact, this is now the same problem as part (b), so we have arcsec a =: 


| Problem 2.31: If x is nonzero and arccos x = arcsec y, then find y in terms of x. 


Solution for Problem 2.31: Let @ = arccos x = arcsec y, so we have cos 0 = x and sec 0 = y. Since x + 0, we can take 


the reciprocal of both sides of x = cos @ to get t = -$ = sec @ = y. Therefore, we have y = +. 0 


A key step in our solution to Problem 2.31 is assigning a variable to the angle that equals the inverse trig 
functions in the problem, and then rewriting the information in the problem in terms of trig functions instead of 
inverse trig functions. 


Concept: Some problems involving inverse trig functions can be most easily solved by 


O= rewriting the problem in terms of the corresponding trig functions. 


Now that we have a little practice working with inverse trig functions, let’s take a look at why we restrict their 
domains and ranges the way we do. 


Problem 2.32: 
(a) Why is [-1, 1] the domain of arccos x, but the domain of arctan x is R? 


(b) We use [0,7] as the range of arccos x. Why can’t the range be a smaller or larger interval? 


(c) Why can’t we also use [0,7] as the range of arcsin x? 


(d) Why must we exclude the endpoints of the interval (-3, z) from the range of arctan x? 


Solution for Problem 2.32: 


(a) The value of arccos x is the angle whose cosine is x. Since cosine cannot be larger than 1 or less than —1, we 
must restrict x in arccos x to be in the interval [—1, 1]. Meanwhile, for any real number x, there’s some angle 
whose tangent is x. Therefore, the domain of arctan x is R. 


(b) Any possible range of arccos x must include angles with terminal points (1,0) and (—1, 0) on the unit circle. 
These two points are terminal points of angles that are x apart. Moreover, as an angle increases from 0 to 7, 
the x-coordinate of its terminal point decreases from 1 to —1 and is never the same for two different angles. 
To include the angles 0 and 7, and all the angles in between, we need an interval of at least length z. To see 
why the range of arccos x cannot be an interval longer than 7, note that we cannot allow two angles with 
the same cosine in the range. Since the circumference of the unit circle covers a range of 27, any interval 
greater than 7 will cover more than half the unit circle. Such an interval will therefore include two points 
with the same x-coordinate, and these two points correspond to angles with the same cosine. 
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(c) Using [0, 7] as the range creates two problems. First, what would arcsin 0.5 be? Both sin § and sin 22 6 equal 
0.5. Second, what would arcsin(—1) be? There are no angles in the interval [0, 7] that have a T AAR sine. 


(d) Tangent is not defined for angles of 5 or —%, so we must exclude these values from the range of arctan x, 
since there is no value of x such that tan 5 = x or tan (-3) = =X. 


Problem 2.33: Evaluate each of the following w without a calculator: 


(a) arcesin(sin(397/5)) (b) — arcsin(sin(377/5)) 


Solution for Problem 2.33: 


(a) What’s wrong with this answer: 


Bogus Solution: - Since arcsin(sin x) = x, we have arcsin(sin(397/ 5)) = 397/5. 


The mistake here is that the range of arcsin x is T oy al and #2 is not in this interval. We only have 
arcsin(sinx) = x if x € [-4, |. So, we have to find the angle in te interval [-2 Se z] whose sine equals 
sin 22. 


While we haven’t learned yet how to evaluate sin 2 without a calculator, we do know how to find 


other angles that have the same sine as the sine of 22". Subtracting multiples of 2x from 2" produces other 
angles whose sine equals sin 2, since subtracting 27 from an angle does not change its ane We find that 


5 f 
an —4-(2n) = —#, so sin 2r = sin (- z). But we don’t know how to evaluate sin (-2), either. However, 


—§ is in the range a arcsin x, so whatever angle sin (-2) is, when we take the arcsine of that angle, we'll 


get -f back: 
arcsin (sin =) = arcsin (sin (-=)) = == 


(b) Ifwe ty the same tactic as in the previous part, we find that aa -3(27) = m and a —4(2n) = 2. Uh-oh, 
neither 2 nor -E is in the range of arcsinx. There isn’t an rele in eb range af arcsin x ets the same 


ena point as the angle 2. Does that mean there’s no solution? 


No! ie don’t need an angle in the range of arcsin x with the same terminal 
point as 4; we need an angle in this range with the same sine. At right we 
have the unit circle, and P is the terminal point of 22. There is another point, 
Q, on the unit circle with the same y-coordinate as P. This point is the terminal 
point of an angle whose sine equals sin %7. Congruent right a arc 
and AQOB give us ZQOB = ZPOA. Since P is the terminal point of - F, we 
have ZPOB = 3m, so LFPOA = Te g = © which means ZQOB = an ree ae 


Q is the terminal point of —-2, and we have 


in (sin “FE ) = arcsin (sin (~~ arcsin (sin (-"F)) = 
arcsin (sin = arcsin | sin 5 = 5 = 5 : 


Problem 2.34: 
(a) Evaluate sin(arccos 0.6) without a calculator. 


(b) Evaluate sin(arccos(—0.6)) without a calculator. 
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Solution for Problem 2.34: 


(a) 


(b) 


If we could find the angle 6 with 0 < 0 < n such that cos 0 = 0.6, then our problem would become finding 
sin 0. But without a calculator, it’s not clear how to find the angle whose cosine is 0.6. 


While we can’t evaluate arccos 0.6, we do at least have another way to view the problem. Instead of 
“Evaluate sin(arccos 0.6),” we can view the problem as 


We have cos 0 = 0.6, where 0 < 0 < n. Find sin @. 


This at least has the advantage of getting rid of the confusing inverse trig function, and leaves us with 
regular trig functions, with which we’re more familiar. 


Concept: _ Rewriting problems involving i inverse trig functions as problems involving only | 


O= regular trig functions c can sometimes make the solution easier to find. 


Our problem now is to find sin 0 given cos 0, so we look for a way to relate the two. Here are two ways 
to do so: 

Method 1: The Unit Circle. Since 0 = arccos 0.6, it is a first quadrant angle, 
as shown in the diagram at right. The Pythagorean Theorem then gives us 


P 
AO? + PA? = PO?, so PA = VPO? - AO? = V1? -0.62 = 0.8. (We might 
also have recalled the Pythagorean triple {3,4,5}; dividing these by 5 gives 
{0.6, 0.8, 1}.) We then have sin 0 = PA = 0.8. 4 


Method 2: Use a Trig Identity. Our first approach is essentially equivalent 
to using the trig identity sin’ 0 + cos?@ = 1. Since cos@ = 0.6, we have 
sin? 0 = 1 — (0.6)? = 0.64. 

We have to be careful to avoid this mistake: 


Bogus Solution: — Taking the square root of both sides gives sin @ = +0.8, so there are two 


solutions. 


Looking back at i original problem, we aio immediately that WSL | is wrong here because 
sin(arccos x) is a function. Therefore, for any value of x in the domain of this function, there is at exactly one 
value of sin(arccos x). To determine which of 0.8 and —0.8 is the value of sin(arccos 0.6), we must remember 
that 0 = arccos0.6. This means @ is restricted to the range of arccos x, which is the interval [0,7]. Sine is 
nonnegative for all angles in this interval, so we discard the negative solution and have sin 0 = 0.8. 


We follow our steps from part (a), and recast the problem as: 
We have cos 0 = —0.6, where 0 < 0 < n. Find sin 0. 


As before, we can finish with either the unit circle or a trig identity. 
Method 1: The Unit Circle. Since cos 0 = —0.6 and @ is in the interval [0, 7], 
the angle @ is in the second quadrant, as shown. From cos 0 = —0.6, we have 
OA = 0.6. Therefore, we have PA = VPO? — OA? = 0.8, so sin@ = 0.8. 
Method 2: Trig a As in part (a), we use sin? 0 +cos* @ = 1, from which X 


we find sin? 0 = 1 — cos? 0 = 1 — 0.36 =U Since 0 < 0 < 7 and sin@ is 
nonnegative for all angles 0 in this range, sin? 0 = 0.64 gives us sin 0 = 0.8. 
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Problem 2.35: Let y = arcsin x. Find tan y in terms of x. 


Solution for Problem 2.35: We start by getting rid of the inverse trig function. We write y = arcsinx as sin y = x, 
where y € [-£, zl, Now, we must relate tan y to sin y in order to express tan y in terms of x. Unfortunately, we 


don’t have a nice neat identity like we have to relate sin y and cos y. But we can write tan y in terms of sin y and 
cos y, and we already have sin y in terms of x: 


We now just have to express cos y in terms of x. We know how to relate cos y to sin y. From sin? y + cos? y = 1, we 
have 


e y= leyl 


Once again, we face the puzzle of choosing the positive or negative root. Since y € [-z, z] (because y is in the 


range of arcsin x) and cos y is nonnegative for all y in this interval, we have cos y = V1 — x2, so 


x x 
tan y = = : 
cos y 1—x2 


After rewriting the problem without inverse trig functions, our key problem solving strategy in this solution 
is: 


When stuck on a trig problem involving tangent, cotangent, secant, and or cose- | 
cant, try rewriting the problem in terms of sine and cosine. 


Concept: 


O= 


Problem 2.36: Graph y = arcsin x. How is this graph related to the graph of y = sin x? 


Solution for Problem 2.36: The domain of arcsin x is [—1, 1] and the range is |- S z] , SO the entire graph is contained 
S 


x increases from —1 to 1, arcsin x 
increases from —5 to 5. 


Figure 2.24: Graph of y = arcsinx 


The shape of this graph shouldn’t be a surprise; on page 23, we explained that the graph of a function’s inverse 
is the reflection of the function’s graph over the graph of y = x. We see this relationship when we graph y = sin x 
for —% < x < § and graph y = arcsin x on the same Cartesian plane: 
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Figure 2.25: Graphs of y = arcsin x (solid) and a portion of y = sin x (dashed) 


Now that we know how to handle inverse trigonometric functions, let’s tackle a couple equations in which we 
must “undo” trig functions much like we do when we evaluate inverse trig functions. 


Problem 2.37: Find all t such that 2 cos (3t — z) = V3. 


Solution for Problem 2.37: The cosine expression looks pretty complicated. Let’s try to get a handle on the problem 
by solving a simpler version of the problem first. 


Concept: Solving a simpler version of a complicated problem can help solve the original 


O= complex problem. 


Since it’s the complexity of the trig function that is the difficulty here, let’s replace the left side with a simpler 
trig function, like cos x. This gives us 
cosx = V3. 
Uh-oh. That equation has no solutions. Did we change the problem too much, or does the original equation have 


no solutions? 


Looking back at the original equation, 2 cos (3t = z) = V3, it’s the (3t = z) that makes the cosine complicated, 
not the coefficient 2. We can divide both sides by 2 to isolate the cosine: 
m\ _ VB 
cos (st = =) 
Concept: In equations involving trig functions, try to isolate trig functions by manipulating 
Q= the equation so that you have a trig function equal to a constant. 


Now, our “simpler version” of the problem makes more sense: 


cosx = 
~ 
As in Problem 2.29, there are infinitely many solutions to this equation. We have cos x = 2 for x = %, for x = UZ, 


and for any angle that is a multiple of 27 more or less than either of these angles. We can write these solutions as 
x = Z + 2kn or x = 4 + 2kn, for any integer k. 


But how does this help us with 
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The only difference between this and cos x = 28 is that we've replaced x with (3t = z) . So, to solve our complicated 


problem, we replace x with (3t = z) in our solution to the simpler problem. Since “x = Z + 2kn or x = 42 + 2kn, 


for any integer k,” are the solutions to cos x = ae the solutions to cos (3t = z) = = are the values of t such that 


moo m m lln 
3t- — = — +2k —-— = — 
nc T or 3t 7 A +2kr, 


for any integer k. Isolating t in each of these gives us the solutions 


on 2kr p- 20m 2kn 


t 


= + pasha 
es e T 
for any integer k. 


As an Exercise, you'll explore the fact that there are other forms in which we can express these solutions. 0 


| Problem 2.38: What angles ¢ satisfy sec @ tan = —3? j 


Solution for Problem 2.38: This problem is more complicated than the previous problem for a couple reasons. 
First, it’s not immediately possible to rewrite the equation as a trig function equal to a constant. Second, the trig 
functions are secant and tangent, which are less familiar than sine and cosine. We’ll tackle the the second difficulty 
first, by rewriting the problem in terms of sine and cosine: 


1 sind 2 


cosh cosp 3 


Multiplying both sides by 3 cos’ ¢ to get rid of the fractions gives 
3sin@ = —2cos* ¢. 


Now what? 


Concept: Trig equations witha single type of trig function are often easier to deal with than 
Q= equations with two different types of trig functions. 


ar 


We can write 3sin@ = —2cos? @ in terms of just sind by noting that cos*¢ = 1 — sin? @, so our equation is 
3sin@ = —2(1 — sin’ p). Rearranging this equation gives 


2sin? @-3sind —2=0. 


This is a quadratic equation in sin @. (If you don’t see why, let y = sin @, which makes the equation 2y*-3y-2 = 0.) 
Factoring this equation gives 

(2sinġ + 1)(sing — 2) = 0. 
Therefore, we must have 2sin@ + 1 = 0, which gives sin ọ = -5, or sing — 2 = 0, which gives sin ọ = 2. We have 
succeeded in manipulating the equation into a single trig function set equal to a constant. 


We can discard sin @ = 2, since sine cannot be more than 1. To solve sing = -}, we first find the two solutions 
with 0 < @ < 2r, which correspond to the two points on the unit circle with y-coordinate —}. These are @ = % 
and @ = 42. Checking these confirms that both do satisfy the original equation. We can add any multiple of 27 
to these angles, so our full solution is @ = m + 2kn or ġ = 2 + 2kn for any integer k. 0 
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[Exercises EE 


2.5.1 Evaluate each of the following: 


(a) arcsin0.5 (c) tan7!(-1) (e) arccot 8 
(b) arccos (-2) (d) arccos0 (f)  csc™}(-—2) 


2.5.2 Evaluate cos(arcsin 0.3) without using a calculator. 
2.5.3 Evaluate arctan (tan s), 


2.5.4 Why do we choose (-4, x) as the range of arctan x instead of using [0, 72) as the range? 


2.5.5 Find all values of x for which sin(arccos x) < 0. Are there any values of x for which we have cos(arcsin x) < 0? 
2.5.6 Lety = arctan x. Find sin y in terms of x. 
2.5.7 Kira solved Problem 2.37 and produced the solutions 


_ On , 2kn r p22% 
Bese x a oe 


for any integer k. The second set of solutions appears to be subtly different than our solution to Problem 2.37. Did 
Kira make a mistake? 


2.6 Summary 


The unit circle is the circle with radius 1 centered at the origin of the Cartesian plane. The terminal point of an 
angle @ is the point that results when the point (1,0) is rotated counterclockwise by 8 about the origin. 


Definition: The cosine of an angle @ is the x-coordinate of the terminal point of 6. We denote the cosine of 
8 as cos 0. 
The sine of an angle @ is the y-coordinate of the terminal point of 8. We denote the sine of 0 as sin 8. 


The tangent of an angle @ is the ratio of the y-coordinate to the x-coordinate of the terminal point of 0. We 
denote the tangent of 0 as tan @. 


When cos 0 + 0, we can express tan 8 as sing If cos 6 = 0, then tan 8 is undefined. 


The graphs of sine, cosine, and tangent are below. 


y 


Figure 2.26: Graph of y = sinx 
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Figure 2.27: Graph of y = cos x 
y 


Figure 2.28: Graph of y = tan x 


Three other trigonometric functions are secant, cosecant, and cotangent, which are denoted as sec, csc, and 
cot, respectively. These are defined in terms of cosine, sine, and tangent as follows: 


1 
0 = j 
aaa cos 0 
1 
0 = y 
ite sin 0 
coté = 1 S a 


A trigonometric identity is an equation that is true for all angles for which the equation is defined. In this 
chapter, we proved the following identities: 


sin? 0 + cos? 0 = 1, 
cos(—9) = cos 0, 
sin(—0) = -sin 8. 


We also showed that —1 < sin 8 < 1 and —1 < cos 8 < 1 for all 0. 
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Degrees are not the only way we can measure angles. We can also use radians. To B A 
measure an angle in radians, we draw a circle with radius 1 centered at the vertex of the 
angle. The measure of the angle in radians equals the length of the arc cut off by the angle. 


So, in the diagram at right the measure of @ in radians equals the length of arc AB. 


We say that f(x) varies sinusoidally with a variable x, and that f is sinusoidal, if we 
can express f as f(x) = asin(bx + c) + d for some constants a, b, c, and d. 


A function f is called periodic if there is some nonzero constant t such that f(x) = f(x+#) 
for all values of x such that f(x) and f(x + t) are defined. The smallest possible positive value of t is called the 
period of f (if such a smallest value exists). The period of sin ax and of cos ax is Tae while the period of tan ax is 77. 


The amplitude of a trig function is half the difference between the maximum and the minimum value of the 
function. The phase shift of a function of the form f(x) = asin(bx+c), where a, b, and c are constants, is the smallest 
horizontal distance by which the graph of g(x) = asin bx must be shifted to produce the graph of f. Typically, a 
positive phase shift indicates a rightward shift and a negative phase shift indicates a leftward shift. 


By restricting the domains of the trig functions, we can define inverses of them. Below is a table of the inverse 
trig functions, their domains, and their ranges. 


Inverse Trig Function | Domain of Inverse | Range of Inverse 
arccos x [-4, 1] [0,7] 


[-1,1] [-3, 3 
R 


Things To Watch Out For! 


e InSection 1.3, we noted that f*(x) usually means f(f(x)). Trigonometric 
functions are a common exception to this. When we write sin’ x, we 
mean (sin x)’, not sin(sin x). 


WARNING! 
os 


It’s very easy to confuse the definitions of secx and cscx. Don’t be 
afraid to look them up when you encounter secant and cosecant. 


If an angle measure does not include the degree symbol or any indica- 
tion of units, then the measure is assumed to be in radians, not degrees. 
So, if you are asked to find sin 30, the answer is not 1/2, since sin30 
refers to the sine of an angle of 30 radians, not 30 degrees. 


When using a calculator or computer, make sure you know whether 
your device or program is using degrees or radians. If it assumes that 
the angle is in radians, and you enter “tan(30)” to find tan 30°, the 
answer will not be what you intend. One quick way to test it is to use 
it to compute sin 30°. If it returns something like —0.98803 instead of 
0.5, then it is expecting the angles in radians, not degrees. 


REVIEW PROBLEMS 


Problem Solving Strategies 


Graphs can help provide intuition for problems involving trig functions. 


Some problems involving inverse trig functions can be most easily solved 
by rewriting the problem in terms of the corresponding trig functions. 


When stuck on a trig problem involving tangent, cotangent, secant, and or 
cosecant, try rewriting the problem in terms of sine and cosine. 


Solving a simpler version of a complicated problem can help solve the 
original complex problem. 


In equations involving trig functions, try to isolate trig functions by manip- | 
ulating the equation so that you have a trig function equal to a constant. 


Trig equations with a single type of trig function are often easier to deal with. 
than equations with two different types of trig functions. 


I) REVIEW PROBLEMS 


2.39 Evaluate each of the following: (As an extra challenge, try finding them in your head, without writing 
anything down.) 


(a) cos150° (c) csc 90° (e) cos315° 
(b) tan30° (d) tan270° (f)  sin225° 


2.40 Evaluate each of the following: (As an extra challenge, try finding them in your head, without writing 
anything down.) 


(a) sin(—120°) (c) tan 1320° 
(b) cos(—540°) (d) sec 660° 
2.41 For how many values of O such that 0 < 0 < 77 is cos @ = —0.1? 
2.42 What are the domain and the range of the function f(t) = 2 tan(3t — 7)? 
2.43 Convert each of the following to radians: 
(a) 144° (b) 336° 
2.44 Convert each of the following measures in radians to degrees: 
Glas (b) 


2.45 Evaluate each of the following: (As an extra challenge, try finding these in your head, without writing 
anything down.) 


(a) cos 32 (©) eae 
(b) — sin(—7) Gece. 
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2.46 Which of the following are positive: cos 1.37, sin(—6), tan m, sec 6? 
2.47 Determine the sign of sin 3 + sin 4 without using a calculator. 
2.48 Is it possible for sin(sin x) to equal 1? 


2.49 Jenny is writing a computer program that contains the following line of code: 
xX = t*sin(y). 


This line of code sets x equal to the product of t and the sine of y. Unfortunately, y is in degrees, and the “sin” 
function in her program expects it to be in radians. How should she change this line of code to obtain the desired 
result? 


2.50 


(a) What are the asymptotes of the graph of y = cot x? 
(b) What are the asymptotes of the graph of y = cot3x? 


2.51 Michelle graphs a function of the form f(x) = asin bx, where a and b are nonzero constants, and then graphs 
g(x) = acos bx. She uses a blue pen for one graph and a red pen for the other. She gives you a 1-second glance at 
her two graphs—how can you quickly tell which color she used for the graph of f? 


2.52 Find the period, amplitude, and phase shift of g(x) = —2 sin G = =) 


2.53 What is the period of g(x) = —2 tan(3x — 7)? 


(a) Graph y = sin (z ae x) 
(b) Graph y = sin (4 -x). 
(c) What relationship between sin (z = x) and cos x is suggested by the graph in part (b)? 
2.55 Graph y = 2sin (2 = m). (Be sure to check that you have the correct graph by choosing values of x, 


finding the corresponding values of y, and confirming that your graph passes through each ordered pair (x, y) 
you produce.) 


2.56 Leah graphs the function h(x) = acos bx for some values of a and b. The resulting graph is shown below. 
Find all possible values of a and b. 


Figure 2.29: Graph for Problem 2.56 
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2.57 Evaluate each of the following: 
(a)  arccos(—0.5) (c) tan 0 
(b) arcsin(—1) (d) arccos (v3 /2) 
2.58 Evaluate sin(arctan 2) without using a calculator. 
2.59 Find all values of k such that 3 cot(4k — 7) = V3. 
2.60 Graph y = arctanx. What are the asymptotes of the graph? 
2.61 The range of arcsec x is [o, z) U (z, | . Why must we leave 5 out of the range? 


2.62 If 8tan@ = 3cos 0, where 0 < 0 < 7, then determine the value of sin @. 


Challenge Problems 


(a) How is the graph of y = sin (x = 3) related to the graph of y = cos x? 


(b) What relationship between the functions f(x) = cosx and g(x) = sin (x — z) does your graph in part (a) 
suggest? 


2.64 Steph graphs the function h(x) = sin(px + q) for some values of p and q. The resulting graph is shown below. 


Figure 2.30: Graph for Problem 2.64 


Find all possible values of p and q. Hints: 91 
2.65 What is the period of f(x) = (sin x)(cos x)? Hints: 131, 239 


2.66 How many values of x satisfy sin x = es ? Hints: 237 
2.67 Find arccos (sin $). Hints: 31, 187 

2.68 For what values of 8 is sin 30 < 0.5? Hints: 158 

2.69 Let g(x) = sinax + cos bx. 


(a) Ifaand bare integers, must g(x) be periodic? 


(b) Ifaand bare rational numbers, must g(x) be periodic? 
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2.70x While finding the sine of a certain angle, an absent-minded professor failed to notice that his calculator 
was not in the correct angular mode. He was lucky to get the right answer. What are the two smallest positive 
values of x such that the sine of x degrees equals the sine of x radians? (Source: AIME) Hints: 83, 204, 55 


2.71x The price of Up-N-Down Stock varies sinusoidally with time. At the start of this year, the price of the 
stock is $110 per share. Over the next 3 days, the price climbs to a peak of $120 per share. If the lowest price the 
stock reaches is $80 per share, how much money can you make this year (in 365 days starting from the start of this 
year) if you start with $1000 and time all of your trades perfectly? (Assume that you can buy fractions of a share 
of stock, and that you can buy stock as many times as you want, but can never sell stock you haven't already 
bought first.) Hints: 193 
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Let it be an alliance of two large, formidable natures, mutually beheld, mutually feared before yet they recognize the deep 
identity which, beneath these disparities, unites them. — Ralph Waldo Emerson 


CHAPTER 


— 


Trigonometric Identities 


A trigonometric identity is an equation involving trig functions that is true for all values of the variables for 
which the equation is defined. Back on page 38 we proved one of the most important trig identities, 


sin? 0 + cos? 0 = 1. 


Throughout Chapter 2 we hinted at several other identities, and on page 50, we used the unit circle to prove 
sin(—@) = -sin 0 and cos(—@) = cos @ for all 0. In this chapter, we derive and explore many more trig identities, 
and learn how to apply them to challenging problems. Don’t try to memorize each new identity you encounter in 
this chapter. Instead, try to understand why the identities are true. Many of them will become automatic if you 
understand their derivations. Moreover, in Chapter 7, we'll see how complex numbers can help you remember 
many of these identities. 


In this chapter, you will be asked to compute many trigonometric expressions. Do not use calculators or 
computers for these problems. The point of these problems is to learn how to apply identities to simplify 
expressions. 


Don’t bother memorizing all the identities in this chapter. We’ll let you know 
which ones are the most important, and you should make sure you understand 
those well enough that you have them memorized. The main purpose of this 
chapter is to learn how deeply related the trigonometric functions are, and how 
to use those relationships to simplify expressions and solve problems. You can | 
usually look up the more obscure identities whenever you need them. Focus 
instead on how to apply them. 


In that vein, you should know the identities sin’ @ + cos? @ = 1, cos(—@) = cos@, and sin(—@) = —sin@ well 
enough that you have them memorized. You can probably most easily internalize them by visualizing them on 
the unit circle. 
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3.1 Introduction to Trig Identities 


_Problems _ 


Problem 3.1: In this problem, we find all x with 0 < x < 27 such that tan x + sec x = 3. 
(a) Write the equation in terms of sin x and cos x. 


(b) Solve the equation for sin x in terms of cos x. 


(c) Use sin? x + cos? x = 1 to find cos x. 


(d) Find all x with 0 < x < 2n that satisfy the original equation. (Don’t forget to check your answers.) 


Problem 3.3: Show that sa aal = cotx. 
1+tan*x 


Problem 3.4: Use the unit circle to express each of the following in terms of sin 9, cos 0, or tan @: 
(a) sin(90° — 0) (b) cos(90° — 0) (c) tan(90° — 0) 


Problem 3.5: Express each of the following in terms of sin 0, cos 0, tan 9, or cot @: 
(a) cos (z + a) (c) tan(n+ 8) 
b) sin(x-@) (d) cos(0- 3) 


In each part, use the unit circle and/or graphs of trigonometric functions to explain your results. 


Problem 3.6: In this problem, we find the smallest positive angle @ such that sin 20 = cos 30. 


(a) Use an identity from this section to write cos 30 in terms of sine. (You do not have to write it in terms of 
sin 0; you should express cos 30 as the sine of some expression.) 


(b) Find an angle that satisfies your equation from part (a). 
(c) Explain why the equation does not hold for any smaller positive value of 0. 


We'll start with an example that shows one reason why we care about trigonometric identities. 


Solution for Problem 3.1: We start by writing the equation in terms of sine and cosine. 


A fruitful first step in many trig problems is rewriting the problem in terms of 
sine and cosine, because most people have more experience with sine and cosine 
than with the other trig functions. 


Concept: 


1 
Our equation now is za = V3. Multiplying both sides by cos x gives sin x + 1 = V3 cosx. Now what? 


cOSxX COSX 


Concept: | When solving an equation that contains trig functions, it’s often best to write the 
Q= equation in terms of a single trig function, so that you can then try to solve the 
equation for that function. 
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We write sinx +1 = V3 cos x in terms of a single trig function using sin? x + cos? x = 1. Since cos? x = 1 — sin? X 


we square both sides of the equation sin x + 1 = V3 cos x, so that we can substitute 1 — sin? x for the resulting cos? x. 
Squaring the equation gives 


(sin x + 1} = 3cos? x. 


Substituting cos? x = 1 — sin? x and expanding both sides gives sin? x + 2sinx + 1 = 3 — 3sin? x. Simplifying this 
equation gives 2 sin? x + sin x — 1 = 0. This equation is quadratic in sin x. Factoring gives 


(sinx + 1)(2sinx — 1) = 0 


From sinx + 1 = 0, we have sinx = -1, which gives us x = =. (Remember, we are only asked for solutions such 


that 0 < x < 27.) From 2sin x — 1 = 0, we have sin x = t, which gives us the solutions x = 7 and x = s, 


What’s wrong with this finish: 


Bogus Solution: Therefore, our solutions are x = =, 3 


gi = 


If you try each of these “solutions,” then you'll see a few problems. The value x = Ẹ does indeed satisfy the 


original equation, but when x = =, we have tany = ait and secx = —2%8, so tanx + secx = — V3. Therefore, 


37 
x= Sz j is not a solution to the an equation. Moreover, when x = 3#, both tan x and sec x are undefined. How 


did we get these false solutions? 


3n 
and oe 


WARNING!! Whenever you solve an equation by performing some possibly irreversible 
“S steps, you must check your solutions at the end, to make sure they satisfy the 


original equation. Two common irreversible steps are squaring an equation 
and multiplying both sides of an equation by an expression that might equal 
Zero. 


Here, we took both of the irreversible steps described in the warning; the first when we squared both sides 
and the second when we multiplied both sides by cosx. The first of these ultimately brought us the solution 
x = and the second brought us x = %. Since these “solutions” do not satisfy the original equation, we call 
them extraneous solutions, and they are not solutions of the original equation. Therefore, the only solution to the 


original equation is x = 7. O 


In addition to helping us solve equations, the identity sin? x + cos? x = 1 can help us derive new identities. 


Problem 3.2: Prove the identity tan? x + 1 = sec? x. 


Solution for Problem 3.2: Writing the left side in terms of sine and cosine leads us quickly to the proof: 


sin? x sin? x + cos? x 1 5 
tan*’x+1= 5 wi o =m SS oe 
cos? x cos? x cos? x 


2 


As an Exercise, you'll also prove the related identity cot? x +1 = csc? x. 


Important: tan? x + 1 = sec? x, 


WW cot x +1 = cse x. 
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csc x )(sec x 
Can = CO, 
1+ tan*x 


Problem 3.3: Show that 


Solution for Problem 3.3: We could write everything in terms of sine and cosine, but we see that we can simplify 
the denominator of the left side immediately with 1 + tan? x = sec? x. So, we do that first: 


(csex)(seex) (cscx)(secx) _ cscx _ aa, _ COsx 


1+ tan? x sec? x secx —L sin x 


Problem 3.4: Use the unit circle to express each of sin(90° — 0), cos(90° — 0), and tan(90° — 0) in terms of sin 0, 


-cos 0, tan 0, or cot 0. 


Solution for Problem 3.4: In the diagram, point P is the terminal point of the angle 0. 
To find the terminal point of 90° — 0, we start at (0, 1), which is the terminal point of 
90°, and go 0 clockwise, since @ is subtracted from 90°, not added. If @ is acute, then 
drawing altitudes from P to the x-axis and Q to the y-axis produces right triangles 
AQBO and APAO. Since QO = PO and ZQOB = ZPOA, we have AQBO = APAO. 
Therefore, PA = QB and OA = OB. 


Since P is the terminal point of 0, we have cos@ = OA and sin 0 = PA. Since Q 
is the terminal point of 90° — 0, we have cos(90° — 0) = QB and sin(90° — 0) = OB. 
Combining these with PA = QB and OA = OB from above gives us 


cos(90° — 0) = QB = PA = sin 0 


and 
sin(90° — 0) = OB = OA = cos 0. 


We appear to have two new identities, but we haven’t yet shown that these identities hold for all angles. We 
can extend our work above to other angles with some careful casework, or we can use the symmetry that our 
diagram suggests. 


The actions of rotating the point (1,0) an angle 0 counterclockwise and rotating 
(0,1) an angle @ clockwise are symmetric about the graph of y = x, which is dashed 
in the diagram at right. Therefore, the corresponding terminal points P and Q are 
symmetric about the line y = x. As we described on page 23, this means that we 
find the coordinates of Q by reversing the coordinates of P. Since P has coordinates 
(cos 6, sin 0), point Q has coordinates (sin 0, cos 0). But because Q is the terminal 
point of 90° — 0, we can also express Q as (cos(90° — @),sin(90° — @)). Since we 
can express Q both as (sin 0, cos 0) and as (cos(90° — 0), sin(90° — @)), we must have 
cos(90° — 0) = sin @ and sin(90° — 0) = cos 0. 


sin(90° — 0) = cos 0, 
cos(90° — 0) = sin 0. 


3.1. INTRODUCTION TO TRIG IDENTITIES 


tan(90° — 6) = cot 0 


We could also have discovered the relationship between cos(90° — x) and sinx by examining the graphs of 
trigonometric functions. Converting to radians, we express cos(90° — x) as cos ( 7 cy and consider the graph 
of y = cos (z = x). To produce this graph, we first create the graph of y = cos (z + x) by shifting the graph of 
y = cosx to the left by 3. 


Figure 3.1: Graph of y = cos (3 i x) 


We then produce the graph of y = cos (z = x) by reflecting the graph of y = cos (z 48 x) over the y-axis. (To see 
why, let f(x) = cos (z a x). Reflecting the graph of f over the y-axis gives us the graph of y = f(—x) = cos (z = n 


as described on page 14.) 
y 


Figure 3.2: Graph of y = cos (3 = x) 


The graph of y = cos (z = x) appears to be exactly the same as the graph of y = sinx, which suggests that 


sinx = cos (z = x) for all values of x. This graphical explanation is not a rigorous proof, but graphs can help us 
discover identities that we can rigorously prove by other means, such as our explanation with the unit circle. 0 


Problem 3.5: Express each of the following in terms of sin 6, cos 0, or tan 0: 


(a) cos (%+0) (c) tan(z +8) 
(b) sin(n- 0) (d) cos ( — sn) 


In each part, use the unit circle and/or graphs of trigonometric functions to explain your results. 


Solution for Problem 3.5: 
(a) What’s wrong with this solution: 
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Bogus Solution: The graph of y = cos (z +2) is the graph of y = cosx shifted 5 to the 
right. This graph is the same as the graph of y = sinx, so we have 
: cos (z + x) = sin x. 


At first, we might question the claim that shifting the graph of y = cosx to the right by } does indeed 
produce the graph of y = sin x. So, let’s try it: 


Figure 3.3: Graph of y = cos x (dashed) and y = cos x shifted 5 to the right (solid) 


Sure enough, the resulting solid graph above is the graph of y = sinx. Maybe the Bogus Solution is 
correct. 
Important: When you think you’ve found a new trigonometric identity, test it for several 
Vv? angles. Make sure you use angles from different quadrants in your testing—a 
proposed identity might work only in a few special cases. 


We test whether cos (z + a) always equals sin 0 by evaluating both for various values of @. Let’s try 


6 = 0. That gives us cos (z + 0) = 0 and sin@ = 0. OK, that works; let’s try 0 = 7. Then, we have 


4 
cos (z + 9) = cos Ff = att and sin 0 = A Uh-oh. Those aren’t equal; they're opposites. If we try a few 


more values of 0, it appears that cos (z + a) and sin 8 are always opposites. 
Looking more closely at our Bogus Solution reveals the error. The graph of y = cos (z + x) is the result 


of shifting the graph of y = cosx to the left by +, not to the right. (See page 13 if you don’t see why.) The 
correct graph is below: 


Figure 3.4: Graph of y = cos (z T x) 


This graph is the same as the graph of y = sin x reflected over the x-axis. Reflecting the graph of y = sin x 
over the x-axis gives us the graph of y = -sinx (see page 14). Since the graphs of y = cos (z + x) and 
y = -sin x appear to be the same, it seems like we must have cos (z + x) = —sinx for all x. 


See if you can also use the unit circle to explain this identity. 
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(b) 


(c) 


(e) 


3.1. INTRODUCTION TO TRIG IDENTITIES 


We're asked to find sin(— 6) in terms of trigonometric functions of 8. Suppose 
0 is an acute angle, and that P is its terminal point. Let Q be the terminal point 
of 7 — 0, so that the segment joining Q with the origin forms an angle of 0 
with the negative x-axis, as shown at right. Drawing altitudes from P and Q 
to the x-axis forms congruent right triangles POA and QOB, from which we 
have PA = QB. Therefore, P and Q are the same distance above the x-axis, so 
they have the same y-coordinates, which means their corresponding angles 
have the same sine. 


If 0 is obtuse (5 < 0 < 7), then we have the essentially the same diagram, 
but with P to the left of the y-axis and Q to the right of the y-axis. 


We can go through similar arguments for values of 9 greater than 7 or less 
than 0. We could also note that rotating the point (1,0) an angle of 0 degrees 
counterclockwise and rotating the point (—1, 0) an angle of 8 degrees clockwise 
are symmetric about the y-axis, as depicted at right. 


If two points are symmetric about the y-axis, then the points have opposite 
x-coordinates and equal y-coordinates. Therefore, we have cos(n—€) = — cos @ 
and sin(z — @) = sin 8. 

Back on page 50, we showed that the period of the tangent function is n. 
Therefore, we have tan(z + 0) = tan 8. 


We must find cos (8 = sn) in terms of trigonometric functions of 0. At right, 
we show a casein which 0 is a fourth quadrant angle (with terminal point P), so 
that 0 — ¥ is a first quadrant angle (with terminal point Q). We draw altitudes 
PA and QB as shown. Since P and Q are the terminal points of angles that are 
an apart, we have ZPOQ = 4, which means that /QOB = 5 — ZPOA = ZOPA. 


This tells us that AQOB = AOPA. Therefore, we have OB = PA. 
Since P and Q are the terminal points of 8 and 6 - ©, respectively, we have 
sin 0 = —PA (P is below the x-axis, so sin 0 is negative) and cos ( 0- sn) = Oj. 


Combining these with OB = PA gives us cos Q = sn) = —sin 0. 


We can also visualize this identity by graphing y = cos (x = an), which we do by shifting the graph of 


y = cos x to the right by %¥. The result is shown below: 


Figure 3.5: Graph of y = cos (x = sn) 


Just as in part (a), the resulting graph is the same as the graph of y = sin x reflected over the x-axis. We 


deduce that the graphs of y = cos (x = an) and y = ~- sin x are the same, so cos (x = sn) = —sin x for all x. 
Testing with x = % gives us cos (x - sn) = cos = = ¥ and -sinx = — (-) = £., Testing with x = z 
gives us cos (x = sn) = cos (-22) = -} and -sinx = —}. As expected, both our tests work. 0 
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Don’t bother memorizing all the possible identities we can form by shifting an angle that is some multiple 
of 90°. As we just saw, thinking about the unit circle or about graphs of trigonometric functions can help you 
rediscover these identities whenever you need them. Probably the most commonly used are the ones we showed 
in Problem 3.4 for functions of 90° — 0 and those for functions of 180° — 0: 


sin(180° — 0) = sin 0, 
cos(180° — 0) = — cos 0. 


el 


Problem 3.6: Find the smallest positive angle 0 such that sin 20 = cos 30. 


Solution for Problem 3.6: We don’t have good tools yet for expressing sin 20 and cos 36 in terms of sin 0 and cos 0. 
However, we can turn cos 36 into the sine of an expression by using the identity cos x = sin(90° — x), which turns 
sin 20 = cos36@ into 

sin 20 = sin(90° — 36). 
Now, we at least see a way to find one solution; the sines of two angles are equal if the angles are equal. Solving 


20 = 90° — 30 gives us 0 = 18°. But how do we know that this is the smallest possible positive solution? 


Consider what happens to both sides of the equation if we have 0° < O < 18°. For 0 = 18°, we found that we 
have sin 20 = cos 30, or sin 36° = cos 54°. When we decrease 0 from 18°, the value of sin 20 decreases, because as 
20 decreases from 36° (but stays positive), the terminal point of 20 gets closer to (1,0). That is, it gets closer to the 
x-axis, which means that sin 20 decreases as 20 decreases from 36° to 0°. Meanwhile, as 0 decreases from 18° to 
0°, the value of cos 30 increases, since the terminal point of 30 gets farther to the right of the y-axis as 30 decreases 
from 54° to 0°. Putting these observations together with sin 36° = cos54°, we conclude that if 0° < O < 18°, then 
we have 

sin 20 < sin36° = cos 54° < cos 30. 


Therefore, we cannot have sin 20 = cos 30 for any positive value of @ less than 18°. 5 


[| Exercises Ba 


3.1.1 Prove the identity cot? 0 + 1 = csc? 0. 
3.1.2 Prove that sin(360° — 0) = -sin 0 for all angles 0. 


3.1.3 Use the identity you proved in Problem 3.2 to solve Problem 3.1 without first converting the equation to 
sines and cosines. 


3.1.4 Find cos ( a + o) in terms of sin @ or cos 0. 


3.1.5 Find the value of the product tan (=) tan (=) tan (5) tan (35) tan (5) 


3.1.6 Find the range of 


sin A(3 cos? A + cost A + 3 sin? A + sin? A cos? A) 


ey = tan A(sec A — sin A tan A) í 


where A is not an integer multiple of 7/2. (Source: HMMT) 
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3.2. SUMS AND DIFFERENCES OF ANGLES 
3.2 Sums and Differences of Angles 


In the previous section, we found identities to simplify expressions like sin (3 + 0) and cos(@ — 180°). In each of 
the expressions we simplified, one of the angles was a multiple of 90°. In this section, we develop identities for 
trigonometric functions of the sum or difference of any two angles. 


We start by proving that 
sin(a + $) = sina cos f + sin f cosa 


for positive angles a and £ such that a + B < 90°. We can extend this proof to all angles with a lot of careful 
casework; we won't do so here. (You'll have a chance to cover some of these cases as an Exercise.) You can use 
this identity for all problems after Problem 3.7. 


[Problems i 


Problem 3.7: In this problem, we will use the diagram at right to prove that A 
if a and f are positive angles such that a + $ < 90°, then 


sin(a + f) = sin a cos f + sin f cos æ 
In the diagram, we have AD = 1, ¿BCA = a, and ZDAE = £. 
(a) Explain why AE = cos $ and DE = sin £. 
(b) Find EC in terms of sine and cosine of a and £$. 
(c) Find ZADB in terms of a and £$. 
Show that AB = sin(a + f) and sina = 48. 
(e) Show that sin(a + f) = sin a cos f + sin f cos a. 


Problem 3.8: Use the identity you found in Problem 3.7 together with identities from the previous section to 
find sin(a — $) in terms of sina, cos a, sin f, and cos f. Test your identity with a = 120° and f£ = 60°. 


Problem 3.9: In Problem 3.4, we found that sin(90° — a) = cosa. 
(a) Use sin(90° — a) = cosa together with the identity for sin(a + f) to express cos(a@ — $) in terms of sina, 
cos a, sin B and cos f. 
(b) Test your identity in the previous part by letting a = 120° and $ = 60°. If the result is not a true statement, 
then try the first part again. 


Find an identity for cos(a + p). 


(c) 


Problem 3.10: Suppose that tana, tan £, and tan(a + f) are all defined. Find tan(a + $) in terms of tana and tan $. 


Problem 3.11: 
(a) Find cos 15°. 


(b) Find sin 22. 


4 5 , ' 
Problem 3.12: Find the value of cos (arcsin 5 + arctan = without using a calculator. 
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Problem 3.13: Below is the graph of y = 3sinx + 4cosx: 


Y 


It looks like the graph of a function of the form a sin(bx + c)! Why? 


| 


Problem 3.7: Use the diagram at right to prove that if a and f are positive angles A 
such that a + f < 90°, then 


sin(a + f) = sin æ cos f + sin f cosa 


In the diagram, we have AD = 1, ZBCA = a, and ZDAE = £. 


Solution for Problem 3.7: We start by finding the lengths of various segments in the diagram in terms of sine and 
cosine of a and $. Applying the right triangle relationships we learned on page 35 to AADE, we have cos f = 45 
and sinf = 2E. We therefore have AE = cos $ and DE = sin. 


Turning to ADEC, we have tana = DE, so 


_ DE _ sinfcosa 

tana sing 
Next, we consider right triangle ADB. To find either leg of this 
triangle in terms of trig functions of a and £, we'll first have to find 
the acute angles of AADB in terms of a and f. From the angles 


in AADC, we have ZADC = 180° — DAC — ZDCA = 180° - q — $. 
Therefore, we have 


ZADB = 180° — ADC = 180° — (180° - æ - 8) =a + B. 


We can now use AABD to find an expression for sin(a + f): 


; : _ AB 
sin(a + f) = sin ZADB = AD: 3 


Since AD = 1, we have AB = sin(a + $), as shown in the diagram. 
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We're almost there! We still have one more right triangle to investigate: AABC. This triangle gives us 


tum Senos E 
AC AE+EC  cosp+ Sin feos 
so we have the equation 
sin(a + f) 


—— ~ = sina. 
cos ß a sin f cos a 


sing 


Multiplying both sides by the denominator of the left side gives: 

sin f cosa ) 
sina 

= sina cos f + sin f cosa, 


sin(a + f) = sina (cosg F 


as desired. O 


With a great deal of careful casework, we can prove that this identity holds for any angles a and £. You'll 
see some of these cases as an Exercise. We can now use this identity to prove other identities for trigonometric 
functions of sums or differences of angles. 


Concept: Once we find one trigonometric identity, we can often use it to quickly prove 


O= other related identities. 


Problem 3.8: Use the identity you found in Problem 3.7 together with identities from the previous section to 
find sin(a — f) in terms of sina, cosa, sinf, and cos $. Test your identity with a = 120° and £ = 60°. 


Solution for Problem 3.8: We write sin(a — f) as sin(a + (—f)) and use the identity for the sine of a sum of angles: 
sin(a — B) = sin(a + (—£)) = sina cos(—f) + sin(—) cos a. 
Now, we apply sin(—f) = — sin f and cos(—f) = cosf to the right side to find: 


sin(a — f) = sina cos f — sin f cos q. 


If we let a = 120° and f = 60°, we have sin(a — f) = sin 60° = 48 an 


3 il 3 
sina cos f — sin f cosa = sin 120° cos 60° — sin 60° cos 120° (Oe = -(¥) (-5) = sil 


Now, we turn to cosine. 


EEE 
| Problem 3.9: Find identities for cos(a — 8) and cos(a + f). j 


Solution for Problem 3.9: We already have angle awo and difference identities for sine, so we look for ways to relate 
sine and cosine. Two identities come to mind, sin? 6 + cos? 0 = 1 and sin(90° — 0) = cos 0. The first doesn’t look 
immediately helpful, because squaring the identities we already have for sin(a + f) and sin(a — $) looks scary. So, 
we focus instead on sin(90° — 0) = cos @, which gives us 


cos(a — f) = sin(90° — (a — B)) = sin(90° — a + p). 
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Now we have the sine of a combination of three angles. By grouping the first two, we can write it as the sine of a 
sum of two angles, which we know how to handle: 


cos(a@ — B) = sin(90° — a + B) = sin((90° — a) + 8). 


We can now apply the identity 
sin(x + y) = sin x cos y + sin y cos x. 


By letting x = 90° — a and y = f, we have 
cos(a — B) = sin((90° — a) + B) = sin(90° — a) cos B + sin B cos(90° — a). 
Finally, since sin(90° — a) = cos a and cos(90° — æ) = sina, we have 
cos(a — 8) = cos æ cos f + sina sin f. 
We can either do essentially the same thing for cos(a + f), or we can write cos(a@ + $) as cos(a — (—f)) and use 
the identity we just proved: 
cos(a + f) = cos(a — (—f)) 


= cos a cos(—f) + sina sin(—f) 


= cos acosf —sina sin f, 


where we used cos(—f) = cos f and sin(--6) = -sin f in the last step. 5 


Finally, we tackle tangent. 


Solution for Problem 3.10: We already know how to handle sine and cosine, so we write tangent in terms of sine 
and cosine, and use the identities we've already proved: 


sin(a +f) _ sina cos f + sin f cosa 


tan(a + p) = cos(a + 6) ya cos acos f — sina sin B 


Now what? We want an identity in terms of tana and tan $, but we just have sines and cosines. 


There are a few ways we can reach our desired identity. We could divide the numerator and denominator of the 
our huge fraction above by cos a cos f (make sure you see why this works), or we can note that since tana = 524, 
we have sina = tana cosa. Similarly, we have sin f = tan $ cos f, and we can write our identity as: 


Hues = sin a cos B + sipao 
cos & cos f — sina sin B 

tana cos & cos f + tan f cos f cos a 

cos & cos f — tana cos & tan f cos B 

_ (cosacos f)(tan a + tan $) 


~ (cosa cos f)(1 — tana tan $) ` 
Because tana and tan f are both defined, we know that cosacosf + 0. Therefore, the cosa@cosf terms in the 
numerator and denominator cancel and leave 
tana + tan f 


A aati = 1- tana tan f` 


As an Exercise, you'll prove the corresponding identity for tan(a — f). 
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WARNING! The identity 
Aà 

tana + tan f 

— tana tan f 

only holds when 1 -tan a tan $ + 0 and when both tan a and tan are defined 


(that is, when cosa + 0 and cos + 0). If tana tan 6 = 1, then tan(a + $) is | 
undefined. 


tan(a + $) = 


Let's take a closer look at why tan(a + p) is undefined when tana tanf = 1. From tana tang = 1, we as 
sin a sin B = cos & cos £, so cosa cos B — sina sin $ = 0. The left side equals cos(a + 8), which means cos(a + B) = 
so tan(a@ + f) is undefined. 


We now have identities for the sine, cosine, and tangent of a sum or difference of two angles: 


Important: Angle sum and difference identities: 


sin(a + p) = sina cos $ + sin f cos q, 


sin(a — f) = sin æ cos $ — sin $ cos g, 


cos(a + f) = cos æ cos p — sin a sin f, 


cos(a — f) = cosacosf + sina sin f, 


tan æ + tan f 
1 — tan a tan p’ 
tana — tan £ 

1+ tana tanp 


tan(a + B) = 


A “tie 


These identities are sometimes expressed in more concise form using + and F: 


sin(a + B) = sinacosf + sin f cosa, 
cos(a + f) = cos æ cos f F sina sin f, 
tana + tan f 

1+ tanatanp 


tan(a + f) = 
The + symbols on the right sides mean “match the sign of the + on the left,” while the + symbols on the right 
sides mean “take the opposite of the sign of the + on the left.” 


Don’t bother memorizing all of these now. As you've just seen, if you only have sin(a + f) memorized, you can 
derive all the rest pretty quickly. Moreover, in Chapter 7, we'll learn an even faster way to recall these identities. 


If you aren’t quite sure you have remembered an identity correctly, you can easily 
test your memory by choosing easy angles to substitute into your identity. For. 
example, if you aren’t sure whether sin(a + p) equals sinacosf + sin f cosa or 
sin a cos f — sin f cosa, then letting a = 0° and $ = 90° in both will quickly rule out 

the latter. - l 


Problem 3.11: 
(a) Find cos15°. 
(b) Find sin Ba, 


CHAPTER 3. TRIGONOMETRIC IDENTITIES 


Solution for Problem 3.11: 


(a) If we start in the usual way, by building a right triangle on the unit circle, we end up with a right triangle 
with acute angles with measures 15° and 75° and hypotenuse length 1. Unfortunately, we don’t know 
anything (yet!) about the sides of such a triangle. However, we might be able to use our new angle sum 
and difference identities to express 15° in terms of angles whose sine and cosine we know. We list a few of 
the angles we know how to handle: 0°, 30°, 45°, 60°, 90°. Aha! We have 45° — 30° = 15°, so we can use our 
difference of angles formula: 


cos 15° = cos(45° — 30°) 
= cos 45° cos 30° + sin 45° sin 30° 


_ v2 v3 V2 1 
Ri” DD 


_ V6 + v2 
=o. 


(b) We know that adding fractions of the form å and 2 can give us a fraction with 12 as the denominator, and 
we know how to handle sine and cosine of angles like F and z So, again, we look over the angles we know 
how to handle, writing them with denominator 12, and try to find a pair we can combine to get 188; 3, $ 


12/ 12" 
ox 8x 9" Success! We have 4% + $2 = 1r so: 


To we o 12 

, 13n . {(9n An Gyre N 

in san 28 88) aan (282) 
e coe | ea Seems 

4 3 3 4 

a 
oo £. (-2) 
v2- vē 


As a quick check, we can note that this answer is negative, which is what we should expect, since the sine 
of a third quadrant angle must be negative. 


Problem 3.12: Find the value of cos (arcsin $ + arctan 3) without using a calculator. 


Solution for Problem 3.12: If we could calculate the angles that equal arcsin ¢ and arctan $, we could simply add 
these angles and take the cosine of the result. Unfortunately, we can’t find these two angles without a calculator. 


However, we saw in Section 2.5 that we can calculate expressions like cos (arcsin 4), and we can use the angle 


sum formula to break cos (arcsin $ + arctan 5) into expressions like cos (arcsin 4) ; 


( y t 2) = cos (arcsin £) cos (arctan =.) ~ sin ( in =) sin ( t Z) 
cos arcsin z arctan +, = cos | ar sin = cos C I7 sin arcsin = sin { arc an T7 


One of these expressions is pretty easy: by the definition of arcsin, we have sin (arcsin 4) = 2. Let's tackle 


cos (arcsin 4) next. We seek cos 0, where sin 0 = $ and @ is in the first quadrant (since the arcsine of a positive 
number less than 1 is a first quadrant angle). We’ll take two approaches: 


Method 1: Use trig identities. We have cos? 0 = 1-sin? 0 = 1- 52 = 55. Since @ is in the first quadrant, cos? 0 = & 


i = 3 int) =3 
gives us cos 0 = =. So, cos (arcsin 4) =% 
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Method 2: Use the geometric interpretation of sine and cosine. Suppose AABC is a right A 
triangle with ZC = 90° and ZA = @, as shown at right. Then, we have sin@ = 2E and 


cos 0 = ae. Since we know that sin@ = 2, we let BC = 4 and AB = 5. The Pythagorean 3 5 
Theorem then gives us AC = 3, so cos 0 = 2. (We also could have simply recognized the 
Pythagorean triple {3, 4, 5}.) a 

C 4 B 


Next, we tackle cos (arctan 5) and sin (arctan 5). We let @ = arctan Žž, so tang = 2, 
and 0 < } < 4. Again, we present two solutions: 


Method 1: Use trig identities. We could write tan @ in terms of sin ọ and cos ¢, solve for sin@ in terms of cos Q, 
and then use sin” ġ + cos? p = 1. Instead, we'll use sec? @ = tan? ọ + 1 to find 


5X7 25 169 
2 = — = —— = Sse 
sec = (3) +I iat} iad 


Since 0 < @ < 3, we know that seco > 0, so we have sec = #3, which gives us cos = 43. Finally, we have 
sin? $ =1-cos*¢ = 2,80 sind = 5. Therefore, we have cos (arctan 5) = # and sin (arctan Š) = 3. 
Method 2: Geometry. Let AXYZ be a right triangle with /Z = 90°, ZY = 5, Y 


XZ = 12, and 2X = ọġ, as shown at right. We have tang = xg = 2. The 


Pythagorean Theorem (or knowing the {5, 12,13} Pythagorean triple) gives us is 13 
XY = 13, so sin ọ = x = 3 andcos¢ = 4% = E, 
c 
Finally, we have zZ 12 X 
( ne + arct 2 ) = ( j = | cos (arctan =) — sin (arcsin =) sin. (arctan 2) 
cos | arcsin 5 arctan ea cos | arcsin 5 o P 5 I 

-9 12_4 5 
5 13 513 
_ 16 
65` 


Solution for Problem 3.13: The graph appears to have period 27 and amplitude 5, which might make us first guess 
that it is the graph of y = 5sinx. The given graph doesn’t pass through the origin, so it can’t be the graph of 
y = 5sin x; however, it might be the result of shifting the graph of y = 5 sin x horizontally. If so, then it is the graph 
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of y = 5sin(x + 0) for some angle 0. If this graph is also the graph of y = 3sinx + 4cos x, then we must have 
5sin(x + 0) =3sinx+4cosx 


for all x. Is that possible? Is there any value of 0 for which this is true? 


It’s not clear how we could isolate 0, but we can at least isolate sin(x + 0) by dividing both sides by 5, which 
gives 


4 
sin(x + 0) = = sinx + 5 COS X. 


3 4 . aa TE 4 ; 
Aha! We saw $ and = in the previous problem. We have (3) + ( 4) = 1, so we can view = and ; as the cosine 
and sine of some angle. Using our identity for sin(x + 0) tells us just what angle: 
sin(x + 0) = sin x cos 0 + sin @ cos x. 


So, if we let 9 = arccos z which means cos 0 = 2 and sin 0 = $, then we do have sin(x + 0) = 2 sin x + $ cos x, SO 
5sin(x + 0) = 3sin x + 4 cosx. 


Mystery solved! 0 


Problem 3.14: Compute x if arctan x + arctan 1 = 2(arctan x — arctan $). (Source: ARML) 


Solution for Problem 3.14: First, we simplify the equation. We seek x; while it isn’t immediately obvious how to 
isolate x, we can isolate arctan x. Doing so gives us 


1 
arctan x = arctan 1 + 2 arctan 3" 


We have arctan 1 = 7, which gives us 


arctan x - + 2arct 1 
=—- rctan —. 
4 3 


But now what? 


We'd like to solve for x; now that we've isolated arctan x, we can isolate x by taking the tangent of both sides, 
which gives us 


T 1 
tan(arctan x) = tan (= + 2 arctan 5) ‘ 


By definition, we have tan(arctan x) = x. We have the tangent of a sum on the right, so we can apply the identity 


tana + tanp 
tan(a + B) = ———_——.. 
=i p) 1 — tana tan f 


We then have 


tan i + tan (2 arctan 1) 


= a 
1 — tan a tan (2 arctan 1) 


Since tan 2 = 1, we have 
4 


1+ tan (2 arctan 1) 
— 
1 
3 


1 -— tan (2 arctan ) 
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We're almost there—all we have to do is compute tan (2 arctan 1). Fortunately, the angle sum identity for tangent 
can do the job again: 


tan (2 arctan 5) = tan (arctan 5 + arct 5) 
a al 3 arctan 3 


tan (arctan 1) + tan (arctan 1) 


~ 1—tan (arctan 1) tan (arctan 3) 


en, i 
-A73 


1.1 
33 


Now we can find x: 
1+ tan (2arctan 1) 1+ 3 


3 


o 


[[_____Exercises E 
3.2.1 Use the identities we learned in this section to write each of the following in terms of cos 0, sin 0, or tan @. 
(a)  sin(z +9) (c)  cos(37 — 0) 
(b) cos (z + a) (d) tan (9 a z) 


tana — tan f 


3.2.2 Show that tan(a = p) = 1+ tanatanp 


3.2.3 Evaluate tan 285°. 
3.2.4 Prove that sin(a — $) sin(a + $) = sin? a — sin’ $. 
3.2.5 Find all acute angles x such that sin 2x sin 3x = cos 2x cos 3x. (Source: AHSME) 


3.2.6 Angles A and B are acute angles such that tanA + tanB + tanAtanB = 1. If A -B = 41°, compute the 
measure of angle A. (Source: NYSML) 


3.2.7 Suppose that sina + sinb = ¥5/3 and cosa + cosb = 1. What is cos(a — b)? (Source: AMC) 


3.2.8x In the text, we proved that sin(a + $) = sin a cos f + sin f cos æ only for the case where a and f are acute 
angles such that a + f is also acute. In this problem, we explore the type of careful casework we can use to prove 
that this identity holds for all angles. We will prove that the identity holds if a and f are acute but a + $ is obtuse. 


(a) Lety = %-aand ọ = 5 — $. Why must y and ọ be acute? 

(b) Write sin(a + $) in terms of y, ọ, and 7. 

(c) Show that sin(a + f) = sin(y + Q). 

(d) Express sin(y + $) in terms of sina, sin £, cosa, and cos f to complete the proof of the identity for the case 
in which a and £ are acute but a + $ is obtuse. Why is it important in this step that y and @ are acute? 


(e)x Show that the identity holds if a and £ are both obtuse. 
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3.3 Double and Half Angles 


In Problem 3.14, we used the identity for the tangent of a sum of angles to evaluate tan (2 arctan 4). In other 
words, we used the angle sum identity to evaluate the tangent of double an angle. Double angles and half angles 
occur frequently in trigonometry; in this section we develop and apply identities for trig functions of both. 


: Problems 


Problem 3.15: Find sin2x and cos2x in terms of sinx and cosx. Test your identities by finding sin 120° and 
cos 120° with them. 


| Problem 3.16; Find tan 2x in terms of tan x. | 


Problem 3.17: Use your identity for cos 2x in Problem 3.15 together with sin? x + cos? x = 1 to find cos(x/2) 
and sin(x/2) in terms of cos x. Test your identities by finding cos 120° and sin 120° by letting x = 240° in your 
identities. 


Problem 3.18: Find cos 15° using an identity you found in the previous problem. Compare your answer to the 
result in Problem 3.11. 


sin x _ l= cosx 
2 1+cosx sin x 


Problem 3.19: Show that tan 2m 


Problem 3.20: Compute the number of degrees in the smallest positive angle x that satisfies the equation 
8 sinx cos? x — 8sinf xcosx = 1. (Source: ARML) 


Problem 3.21: Find (cos 20°)(cos 40°)(cos 80°). 


Problem 3.15: Find sin 2x and cos 2x in terms of sin x and cosx. Test your identities by finding sin 120° and Ẹ 


cos 120° with them. 


Solution for Problem 3.15: Applying the angle sum identities for sine and cosine, we have 


sin 2x = sin(x + x) = sinxcosx + sin x cos x = 2sinxcosx, 


cos 2x = cos(x + x) = cos x cos x — sin x sin x = cos? x — sin’ x. 
We can use the identity sin? x + cos* x = 1 to write cos 2x solely in terms of cos x or in terms of sin x. Substituting 
cos? x = 1 — sin? x gives 
cos 2x = cos? x sin x = (1 — sin? x) — sin? x = 1—2sin’ x, 
and substituting sin? x = 1 — cos? x gives 


cos 2x = cos? x — sin? x = cos” x — (1 — cos? x) = 2cos* x — 1. 


We can test these by letting x = 60°. Then, we have sin 2x = sin 120° = 3 and 2sinxcosx = 2: 2 $= = We 
also have cos 2x = cos 120° = —} and cos? x — sin? x = 1 — 3 = —}. So, both our identities work for x = 120°. g 
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Problem 3.16: Find tan 2x in terms of tan x. 


Solution for Problem 3.16: Applying the angle sum identity for tangent, we have 


tanx+tanx _ 2tanx 


ae ees ee 


Important: Double angle identities: 


W sin 2x = 2 sin xcosx, 


2 2 


cos 2x = cos? x — sin? x = 2 cos? x — 1 = 1 — 2 sin? x, 
2tanx 

tan 2x = ——. 

1 — tan‘ x 


You'll use these identities enough that they're worth memorizing. (In fact, you'll likely use them so much in 
this chapter that you'll have them memorized by the end of the chapter.) 


| Problem 3.17: Find cos 3 and sin À in terms of cos x. i 


Solution for Problem 3.17: We don’t yet have any identities involving half angles. However, we do have double 
angle identities, which means we can express a trig function of an angle in terms of trig functions of half that 
angle. For example, if we start with 

sin2@ = 2sin 0 cos 0, 
and let 6 = 5, we have 

inx = 2sin = cos =. 

sin x 5 98 5 
We now have an angle of 5, but we unfortunately have both the cosine and sine of this angle. This happened 
because our identity for sin 20 includes both sin 6 and cos 0. We'd prefer to use a double angle identity that only 
has one of sin 9 or cos 0, not both. Fortunately, cos 20 offers such identities: 


cos 29 = 2cos?@ -1=1-2sin’ 0. 
Letting 0 = % in cos 20 = 2 cos* 0 — 1 gives 


xX 
cos x = 2 cos? ~ 1. 


oe 1+cosx 
cos 5 = | ; 


The + here does not mean that cos 4 has two values! It means that we will take either the positive or the negative 
square root, depending on what quadrant 5 is in. For example, if x = 240°, then ž = 120°, which is a second 
quadrant angle. Therefore, cos 120° is negative, and we must take the negative root in the identity. Letting x = 240° 


in the identity gives 
T J1 +cos240° ey m 
cos = > 2 = 2 = 4 = a 


Solving for cos 5 gives 


as expected. 
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We deal with sin % by letting 0 = 4 in cos20 = 1 — 2 sin? 6, which gives us 
cosx =1-2sin? x 


Solving for sin 5 gives 


Once again, the + is a reminder that we must take the positive or negative root depending on the quadrant of 
5. We again consider the example x = 240°, which makes 5 = 120°, a second quadrant angle for which sine is 
positive. Therefore, we have 


cesta 1- (-4) T V3 
Se e a ay ey 
2 2 i 3 


Let’s try putting our half angle identities to work on an angle we can’t tackle with the unit circle. 


as expected. O 


Problem 3.18: Find cos 15° using an identity you found in the previous problem. Compare your answer to 


the result in Problem 3.11. 
Solution for Problem 3.18: Earlier, we used the angle difference identities to find cos 15° = cos(45° — 30°) = e 


We now can use half angle identities to compute cos 15° as cos %. Since 15° is a first quadrant angle, we take the 
positive root, and we have 


a 30° 1 + cos 30° 1238 2443 4/2 43 
cos 15 -o a = 5 aS Get ae 


Uh-oh! This doesn’t look anything like the result we found with cos(45° — 30°). Did we do something wrong? 


If we calculate bo and AnaL with a calculator, it appears that the two are equal. They sure don’t look 
equal! To confirm by hand that they are equal, we get rid of some of the square roots by squaring both numbers: 


k oeeo 
a 


6+V2 
ty ES 


16 16 
(AF5) a 
2 a. i 


Now we see the whole story. Our two results for cos 15° are positive and have equal squares, so the two must be 
the same. O 


sin x m 
2 1+cosx 


Problem 3.19: Show that tan = 


Solution for Problem 3.19: We already have cos 5 and sin 3, so we can find tan 5: 


1—cos x 
If > $ 5 V1 — cosx 


x 8 
tan > = 5—1 I i : 


2 cosy = 4, /Ltcosx V1+cosx 
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Again, we include + to indicate that we we'll use the quadrant of 5 to determine whether to take the positive or 
the negative result. We can simplify our expression a bit by getting rid of the square root in the denominator. We 
do so by multiplying the numerator and denominator by V1 + cosx. This gives us 

ree Vi cose yal + cos 
2 Vl+cosx V1+cosx 
ee Vi — cos x)(I + cos x) 


1+ cosx 
V1 — cos? x 


1+cosx ` 


2 


We recognize 1 — cos? x as equal to sin? x, so we can simplify even more: 


x 4 Nic cosir _ Vsin? x = sin x 


= s dpe 
2 1+cosx 1+cosx 1+cosx 


We should worry about what happens if the denominator on the right side is 0. If 1 + cosx = 0, then cosx = —1, 
which means x is an odd integer multiple of n. For these values of x, tan ¥ is indeed undefined, because cos 4 = 0 
whenever x is an odd integer multiple of n. 


We might think we're finished, but then we notice that the problem claims that 


; X sinx 
an- = ———. 
2 1+cosx 


Our result above has a + in front of the right side. Is it true that we can drop the +? The value of 1 + cos x is always 
nonnegative, since cosx = —1. So, to compare the sign of tan 5 to the sign of -== , we need only compare the 


2 1+cos x’ 
signs of tan 5 and sinx. We investigate by considering all possible values of x. 


If sinx = 1 or if x is in the first or second quadrant, then x = 0 + 2kn for some integer k and some 0 with 
0 <0 < n. Then, sin x is positive, and 5 = g +kn. Since 0 < 2 < 7, the angle 2 +kr is in the first or third quadrant, 
which means tan 5 is also positive. 


If sinx = —1 or if x is in the third or fourth quadrant, then x = 0 + 2kn for some integer k and some 0 with 
nv < 0< 2r. This means sinx is negative and 5 = S + kn. Since = < g < 7, the angle 2 + kr is in the second or 
fourth quadrant, which means tan ž is also negative. 


Finally, we note that if x is an even multiple of z, then tan 5 = 0 and sin x = 0. (We noted above that if x is an 
odd multiple of z, then both tan 5 and (sin x)/(1 + cos x) are undefined.) 


The sign of tan 5 therefore matches that of sin x for all possible values of x for which tan 3 is defined, so we 


have : 
i 5 sin x 
an = = ————.. 
2 1+cosx 


We multiply the numerator and denominator of the right side by 1 — cos x to produce the other desired identity: 
X sinx 1-cosx _(sinx)(1- cosx) _ (sinx)(1- cosx) _ 1- cosx 


tan — = ————: = : 
2 1+cosx 1-cosx 1 — cos? x sin? x sin x 


Extra! Challenge: Prove that 
me 1m a a arctan 1 + arctan 2 + arctan3 = n 


by drawing three well-chosen triangles on the Cartesian plane. Answer on page 110. 
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Important: Half angle identities: 


O 1+cos@ 
COS 5 = = (/ a 
aa ee 1—cos@ 
2 pe 
anf = Sin @_ _ 1-cosé | 
2 1+cos0  sin@ 
The + symbols on the first two are reminders that we must use the quadrant of 
2 to determine the appropriate sign, 


If you know the double angle identities, then you should be able to quickly derive these half angle identities 
whenever you need them. 


Problem 3.20: Compute the number of degrees in the smallest positive angle x that satisfies the equation § 


8sinxcos° x — 8sin° xcosx = 1. (Source: 


Solution for Problem 3.20: Ultimately, our goal is to isolate a trig function, but the expression on the left is pretty 
complicated. Most intimidating are the fifth powers. We start chipping away at these in the same way we deal 
with high degree polynomials, by factoring. First, we factor out 8 sin x cos x, and we have 


8 sin x cos x(cos* x — sinf x) = 1. 


Now, we have a difference of squares: we can factor cos* x — sint x as 


cost x — sint x = (cos? x}? — (sin? x)? = (cos? x — sin? x)(cos? x + sin? x). 


4 2 2 


tx — sinf x = cos? x — sin* x, which means 


Moreover, cos? x + sin? x = 1, so we have cos 
8 sin x cos x(cos* x — sin* x) = 8 sin x cos x(cos? x — sin? x). 


Our equation now is 
8 sin x cos x(cos? x — sin? x) = 1. 


We've at least reduced the powers of the trig functions, but we still have some work to do. We'd like to simplify 
the equation until we have just a single trig function set equal to a constant. We recognize some expressions 
on the left side that might help us continue simplifying. Specifically, we recognize sin x cos x from the double 


angle identity for sine, and cos? x — sin? x from the double angle identity for cosine. Applying these identities “in 


2 


reverse” gives us sin x cosx = i sin 2x and cos? x — sin? x = cos 2x. Now, our equation is 8 (3 sin 2x) cos 2x = 1, or 


4sin2x cos 2x = 1. 


Once again, we have the product of sine and cosine of the same angle, which makes us think of the double 
angle identity for sine. We have sin 2x cos 2x = 5 sin 2(2x) = i sin 4x, so now our equation is 4 (3 sin 4x) =1,or 
2sin4x = 1. Therefore, we have sin4x = 5. We can use what we know about sine to finish. (This is why we 
wanted to manipulate the equation into a single trig function set equal to a constant.) The smallest positive angle 


with sine equal to is 30°, so we have 4x = 30°, from which we find x = 5°, o 


The key step in our solution is recognizing that expressions in the problem match the forms of the double 
angle identities. 


Concept: Recognizing the forms of common trigonometric identities can help solve many 
Q= trigonometric problems. | 
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Problem 3.21: Find (cos 20°) (cos 40°)(cos 80 


Us ‘ 


Solution for Problem 3.21: This problem doesn’t match the form of any of our identities, but the facts that 80° = 2-40° 
and 40° = 2-20° make us try applying the double angle identities. We have cosines, so let’s try applying the cosine 
double angle identity to the latter two cosines: 


(cos 20°)(cos 40°)(cos 80°) = (cos 20°)(2 cos? 20° — 1)(2 cos? 40° — 1). 
Yuck. And if we apply this again to the cos 40° on the right, it will get even uglier. 


Back to the drawing board. Those angles really make us want to use the double angle identities. The cosine 
double angle identity was a loser, but there is a cosine in the sine double angle identity. For example, we have 


sin 40° = 2 cos 20° sin 20°, 


sin 40° 
20° = l , SE , 
SO COS AT Making this substitution gives us 


sin 40° 


(sin 40°)(cos 40°)(cos 80°) 
2 sin 20° 


20° 40° 80° =| 
(cos 20°) (cos 40°) (cos 80°) 2sin 20° 


) (cos 40°)(cos 80°) = 


At first, this doesn’t look helpful, but then we see the product of sine and cosine of the same angle, which means 
we can use the sine double angle identity again. Running this identity “in reverse,” as in Problem 3.20, we have 
(sin 40°)(cos 40°) = sing’ so we have 


T (sin 40°)(cos 40°)(cos 80°) B (sin 80°)(cos 80°) 


(cos 20°) (cos 40°) (cos 80°) 2 Sin 20° Asina 
We can do it again! Since (sin 80°)(cos 80°) = 5 sin 160°, we have 


(sin 80°)(cos80°) _ sin 160° 
4sin20° ~ 8sin 20°’ 


(cos 20°)(cos 40°)(cos 80°) = 


Now we need to relate sin 160° and sin 20°. We got this far by recognizing relationships among angles in the 
problem. We do the same here. We note that 160° + 20° = 180°. So, we apply the identity sin(180° — x) = sin x to 
see that sin 160° = sin 20°, which means these two cancel in the fraction above and leave the surprising result 


1 
(cos 20°)(cos 40°)(cos 80°) = 3° 


Concept: Recognizing relationships among angles in a complicated trig expression can help 
Q=» simplify the expression. 


[exercises ie 


3.3.1 Evaluate sin 22.5°. 


3.3.2 lfsin@ = i, then what is cos 26? 


2tanx 


Sa for all x for which tan x is defined. 
1+tan*x 


3.3.3 Prove that sin 2x = 
3.3.4 Show that tanx + cotx = 2csc 2x. 
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sin2A —cos2A +1 
sin2A +cos2A +1 


3.3.6 Ifsin2@ = 21/25 and cos 0 > sin 0, then compute cos 0 — sin 8. (Source: Mandelbrot) 


3.3.5 If-7/2< A< n/2 and A + —-7/4, then prove that = tan A. (Source: CEMC) 


3.3.7 Solve the equation cos 0 + sin 0 = y3/2 for 0 < 0 < 27. 
3.3.8 In this problem, we prove the triple angle identities for sine and cosine. 
(a) Prove that cos 30 = 4 cos? 0 - 3 cos @ for all angles 0. 


(b) Find a similar identity for sin 30 by expressing sin 30 in terms of sin 0. 


cos 3x 
)* 5 
(c) uppose — 


ae Source: USAMTS) 


i 
= 3 for some angle x, with 0 < x < Z Dewan © 
sin 


3.4 Sum-to-product and Product-to-sum 


|) Problems BES 


Problem 3.22: 
(a) Express cos acos f in terms of cos(a@ + f) and cos(a — f). 


(b) Find a similar identity for sina sin £. 


(c) Find a similar identity for sina cos $. 


Problem 3.23: In Problem 3.21, we proved that (cos 20°)(cos 40°)(cos 80°) = $ using double angle identities. Use 


one of the identities from Problem 3.22 to find another proof. 


Problem 3.24: 
(a) Use an identity from Problem 3.22 to show that cos x + cos y = 2 cos (3) cos (5%). 


(b) Find a similar identity to that in part (a) for cos x — cos y. 


(c) Find identities for sinx + sin y and sinx — sin y. 


sin 13° + sin 47° + sin 73° + sin 107° 


mos 17° . (Source: ARML) 


Problem 3.25: Compute 


Problem 3.22: 3.22: 
(a) Express cosa cos f in terms of cos(a + f) and cos(a — p). 


| 
| 
| (b) Find a similar identity for sina sin £. 


(c) Find a similar identity for sina cos $. 


Solution for Problem 3.22: 


(a) We first write out the angle sum and angle difference identities for cosine: 


cos(a + B) = cos æ cos f — sina sin f, 
cos(a — fp) = cosa cos f + sina sin $. 
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Adding them cancels sin a sin £ and leaves us with cos(a + p) +cos(a —B) = 2cosacos $. Dividing by 2 gives 
i 
cos a. cos B = 5 (cos(a + B) + cos(a — B)) : 


(b) If we subtract cos(a — $) from cos(a + $) instead of adding, then we cancel cos a cos B, and leave 
cos(a + $) — cos(a — f) = -2 sin a sin £. 
Dividing by —2 gives 
sina sin f = ; (cos(a — $) — cos(a + B)) . 


(c) We don’t have a product of sine and cosine in our angle sum or difference identities for cosine, but we do 
have them in the corresponding sine identities: 


sin(a + B) = sina cos $ + sin $ cosa, 


sin(a — f$) = sina cos $ — sin $ cosa. 


Adding these and dividing by 2 gives us 


sin a cos B = ; (sin(a + $) + sin(a — B)). 


We call the identities we found in Problem 3.22 product-to-sum and product-to-difference identities. 


| Important: Product-to-sum and product-to-difference identities: 


cos a. cos B = ; (cos(a@ + B) + cos(a — p)) , 
sina sin B = 5 (cos(a — p) — cos(a + B)), 


B): 


sina cos f = ; (sin(a + $) + sin(a — 


Again, there’s nothing to memorize here: these are a quick, direct result of the angle sum and angle difference 
identities. The main idea here is that we can turn a product of cosines or sines into a sum or difference. Sometimes 
this can be particularly convenient, especially when applying the identity gives us angles we know how to handle. 


| Problem 3.23: In Problem 3.21, we proved that (cos 20°)(cos 40°)(cos 80°) = A using double angle identities. 
| Use one of the identities from Problem 3.22 to find another proof. 


Solution for Problem 3.23: We now have an identity for the product of two cosines, but in this problem, we have 
the product of three cosines. To which two should we apply the identity? 


Concept: If a problem involves the product of sines and/or cosines of angles with a conve- 
Q nient sum or difference, consider the product-to-sum identities. 


We apply the identity to (cos 40°)(cos 80°), since the angles in these have sum 120°, which we know to 
handle inside trig functions. (See if you can find another solution by applying a product-to-sum identity to 
(cos 20°)(cos 40°).) We have 


1 1 al 
(cos 80°)(cos 40°) = > (cos(80° — 40°) + cos(80° + 40°)) = 5 (cos 40° + cos 120°) = 5 (cos 40° — >) , 
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so now we have i i 
(cos 20°)(cos 40°)(cos 80°) = 5 (cos 20°) (cos 40° — >) : 


If we expand the product on the right, we'll have another product of cosines with a convenient sum. 


Concept: Ifa tactic makes some progress on a problem, but doesn’t entirely solve it, look 


for a way to use the tactic again. 


Expanding the product gives us 


1 il 1 1 
(cos 20°) (cos 40°)(cos 80°) = z (cos 20°) (cos 40° — 5) = 5 (cos 40°)(cos 20°) — z 008 20° : 


Applying a product-to-sum identity to (cos 40°)(cos 20°) gives 
o o 1 (e) o o Q 1 1 o 
(cos 40°)(cos 20°) = 5 (cos(40° + 20°) + cos(40° — 20°)) = z (7 +008 20°), 
so we now have 


1 
(cos 20°) (cos 40°)(cos 80°) = ((cos 40°)(cos 20°) — z 008 20° 


1/1 X 1 rs 
(5 (5 + cos 20 ) — z 608.20 ) 


1 1 el m 
(4 + 5 c0s20 - 50820") 


Cle NIK NIe NIe 


= 


The two very different-looking proofs of (cos 20°)(cos 40°)(cos 80°) = 3 exemplify how deeply related trigono- 
metric identities are to each other. Many trig problems can be solved in a variety of ways using different 
combinations of identities, so don’t get locked into looking for the solution—there may be several approaches that 
work. 


Problem 3.24: 


(a) Use an identity from Problem 3.22 to show that cos x + cos y = 2cos (54) cos (5) : 


(b) Find an identity similar to that in part (a) for cos x — cos y. 


(c) Find identities for sin x + sin y and sin x — sin y. 


Solution for Problem 3.24: 


(a) We already have an identity that relates a product of cosines to a sum of cosines: our product-to-sum 
identity. We’ll write it with the sum isolated on the left, since that’s the form of the desired identity in this 
problem: 

cos(a + B) + cos(a — $) = 2 cos & cos b. 


To convert this to an identity for cos x + cos y, we simply let a + 6 = x and a — f = y. Adding these gives 
2a = x + y, so a = (x + y)/2. Subtracting a — 6 = y froma + p = x gives $ = (x — y)/2. Substituting these 
expressions for a and f in the identity above gives the desired 


cos xX + cos y = 2 cos (=) cos (=). 


` 
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(b) The previous part gives us a clear path to find an identity for cos x — cos y. We start with 
cos(a — p) — cos(a + $) = 2 sin a sin B. 


Substituting x = a + $ and y = æ — $ in this identity, so that a = (x + y)/2 and $ = (x — y)/2 as before, gives us 


cos y = cosx = 2sin (==) sin (=) , 


Or 


cos x — cos y = —2sin (=) sin = : 


(c) For these, we turn to the product-to-sum identity that features a sum of sines. As before, we isolate the sum 
on the left: 


sin(a + $) + sin(a — p) = 2 sin æ cos £. 


Once again, we use the substitution x = a + $ and y = a — £, so a = (x + y)/2 and B = (x — y)/2, and we have 


3 2 + = 
sinx + siny = 2sin (=) cos (= 1). 
2 2 
We don’t have an identity with a difference of sines among those we found in Problem 3.22, but we can 
convert the sine sum above to a sine difference by noting that sin(—z) = — sinz. So, letting y = —z in the sine 


sum identity above gives us 
3 : a ee 
sinx — sinz = 2 sin (=) cos ( 


<5) 
5 0 


Important: | Sum-to-product and difference-to-product identities: 


cos x + cos y = 2 cos (=) cos Ey, 


cosx — cos y = —2 sin (= sin = 
sinx +siny = 2sin (=) cos (ai 


: : . [X 
sinx — sin y = 2 sin 


As with many of the other identities we've studied, you don’t need to memorize these. We’ve seen that these 
are a direct result of the product-to-sum and product-to-difference identities, which are in turn a straightforward 
manipulation of the angle sum and angle difference identities. 


sin 13° + sin 47° + sin 73° + sin 107° 


. (Source: ARML) : 
cos 17° 


Problem 3.25: Compute 


Solution for Problem 3.25: This is a natural candidate for sum-to-product identities, both because we have a 
sum of sines in the numerator, and because the angles have convenient sums. But how should we pair up the 
sines to apply the sum of sines identity? We could pair the first two and the last two, since 13° + 47° = 60° and 
73° + 107° = 180°, or we could pair the outer two and the inner two, since 13° + 107° = 47° + 73° = 120°. Looking 
at the differences of the paired angles gives us the answer. In our first option, these differences are both 34°, while 
in the second option, we have 107 — 13° = 94° and 73° — 47° = 26°. While 34°, 94°, and 26° are all inconvenient, 
we do note that 34° is double the 17° in the denominator. 
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Concept: In problems involving several trig functions of inconvenient angles, look for 


relationships among the angles. 


Applying the sine sum-to-product identities to sin 13° + sin 47° and sin 73° + sin 107°, we have 


sin 13° + sin 47° + sin 73° + sin 107° _ 2sin 30° cos 17° + 2 sin 90° cos 17° 


cos 17° cos 17° 
cos17° (2-3+2-1) 
p cos 17° 
= e 


o 


| _txercises Ea 


3.4.1 Show that (cos 4x — cos 2x)/(2 sin 3x) = — sin x for all x such that sin 3x + 0. 
3.4.2 Find the acute angle @ such that cos 27° + cos 33° = V¥3.cos 0. 
3.4.3 Consider the following two identities: 

sin(a + f) = sing cos f + sin f cosa, 

sin(a — B) = sin & cos f — sin f cos q. 


If we subtract the second equation from the first, we get an identity for sin f cos æ. But we already have an identity 
for sina cos f, and it doesn’t look just like the result we get for sin f cos æ when we subtract above. Explain. 


3.4.4 Express 
sin 10° + sin 20° + sin 30° + sin 40° + sin 50° + sin 60° + sin 70° + sin 80° 
cos 5° cos 10° cos 20° 
without using trigonometric functions. (Source: HMMT) 


3.4.5 Prove that 

4sin A sin BsinC = —sin(A + B+ C) + sin(B + C — A) + sin(C +A — B) + sin(A + B — C) 
for all angles A, B, and C. 
3.4.6x Find the smallest positive integer solution to 


anoa cos 96° + sin 96° 
~ cos 96° — sin 96° ` 


(Source: AIME) Hints: 282, 24 


` Extra! The diagram at right can be used to prove that 


ALLA IR eA HT 
a arctan 1 + arctan 2 + arctan 3 = 7. 


Don’t see how? Here’s a hint: All three triangles are right triangles, and 
all three have an acute angle with a vertex at (0,1). What is the tangent 
of each of these acute angles? 
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Sidenote: Loosely speaking, sound i is s caused by vibrations that are sinusoidal in ‘nature; | 

A these cause what are called sound waves. These waves have an amplitude (which | 

is related to the volume of the sound) and a frequency. The frequency of a sound 

is related to the pitch of the sound, which is how high or low the sound is. As we | 

noted on page 61, the frequency of a sinusoidal function of time is the reciprocal | 

of the function’s period. So, the period of a sound wave is related to the pitch of | 
the sound. 

If you've ever tuned a guitar by striking two strings and listening to how the 
sounds they make interact, you've heard the sum-to-product rule in action. If the 
guitar is in tune, then you hear a nice, even sound. But if you have the guitar very 
nearly in tune, but not quite, then the sound oscillates. If the guitar is further out | 
of tune, then the sound oscillates a little more. Let’s see if we can explain what’s | 
going here. 

Each string produces a sound wave that is of the form f(t) = sinat for some 
constant a. If the guitar is perfectly in tune, then the two waves don’t interfere at 
all, and you hear a nice, smooth sound. We can get an example of what happens 
here by looking at the graph of y = sin 30x + sin 30x: 


AUA tN uy 
ia 


Figure 3.6: Graph of y = sin 30x + sin 30x | 


That’s a nice, smooth, even sound—the amplitude of the wave is constant 
throughout. But what happens if they are a little off? Below is the graph of 
y = sin31x + sin 29x: 


My yl, Hl, 
mn ne” TMG ‘Waa i nit HART "z | 


Figure 3.7: Graph of y = sin 31x + sin 29x 


| Here, we can see what you hear when the strings are just a little bit out of 
tune. Instead of a nice, even oscillation, there is an overall up and down to 
the intensity of the sound. When the strings are a little more out of tune, this- 
up and down happens more quickly, so the sound oscillates faster. Let’s graph 
y = sin32x + sin 28x to get a picture of ve s happening: | 


il i atl lhl le lal Aa | 


| 
| nen ih TT unin it eT RE TT TE 


Figure 3.8: Graph of y = sin32x + sin 28x 


See if you can use the sum-to-product identity to figure out what's going on here. 
Answer on page 127 | 
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3.5x Problem Solving with Identities 


We’ve solved a number of challenging problems with trig identities. In each case, the section in which the problem 
appeared gave us a very convenient hint which identity to try. In this section, we tackle problems without this 
helping hand, and in doing so, learn some strategies for determining which identity will be helpful on a given 
problem. 


There are no guiding parts on the problems in this section, but we have provided hints that you can look up 
in the back of the book if you get stuck. 


HEE 
| Problems Ba 


Problem 3.26: Determine A + B if A and B are acute angles such that 


sin Á + sin B = yi and cos Á — cos B = 


Problem 3.27: Find the sum of the roots of tan? x — 9 tan x + 1 = 0 that are between x = 0 and x = 27 radians. 
(Source: AHSME) Hints: 137, 225 


(Source: COMC) Hints: 84 


Problem 3.28: Let a = 7/2008. Find the smallest positive integer n such that 
2[cosa sina + cos 4a sin 2a + cos 9asin 3a + --- + cos(n7a) sin(na)| 


is an integer. (Source: AIME) Hints: 189, 195 


Problem 3.29: Let a, b, c, d be positive real numbers such that 


err, 
Chae, 


1 
ac— bd = =. 


Calculate ad + bc. (Source: Mandelbrot) Hints: 79 


Problem 3.30: Evaluate 
cos 1° + cos 2° + cos 3° + --- + cos 43° + cos 44° 


sin 1° + sin 2° + sin 3° +- -- + sin 43° + sin 44° ` 
(Source: AIME) Hints: 103 


Problem 3.31: Given any seven real numbers, prove that there are two of them, x and y, such that 


(Source: Canada) Hints: 212 


3.5x. PROBLEM SOLVING WITH IDENTITIES 


Problem 3.26: Determine A + B if A and B are acute angles such that 


: : 3 
sin A + sinB = C and cos Á — cos B = iB 


(Source: COMC) 


Solution for Problem 3.26: Our first instinct might be to try sum-to-product and difference-to-product identities 
since we have a sum of sines and a difference of cosines. Applying these identities to each equation gives 


asin (442) (ae 
De? oe 


-2sin (245) sin (45*) = L 
2 2 VX 


Uh-oh. It’s not so clear what to do with that. We want A + B, but dividing the first equation by the second 
eliminates the terms involving A + B. It’s not at all clear how we can get rid of the terms involving A — B. Let’s 
look around for something different before trying to force this to work. 


Concept: Don’t get locked into one way of attacking a problem—look for several starting 
O= points, go a little ways down each path, and then choose the one that looks most 
fruitful. l 


We expect that we'll have to combine the equations in some way, but it’s not obvious that adding or multiplying 
them will do any good. We need some way to conveniently combine the sines and cosines, which brings us back 
to sin? 0 + cos? @ = 1. The square roots on the right sides of the original equations are another reason we might 
think of this identity, since squaring each equation will get rid of the square roots. We’re not sure what we'll do 
with the other terms that squaring produces, but maybe it will all work out in the end. 


Squaring each equation gives: 
sin’ A + 2sin Asin B + sin’ B = 


cos? A — 2 cos A cos B + cos? B = 


NI= NIW 


Adding these allows us to use sin? A + cos? A = sin? B + cos? B = 1, and we have 
2 +2sinAsinB —2cosAcosB = 2, 


so sin A sin B — cos A cos B = 0. This is promising, because we're looking for A + B, and the left side looks a lot like 
the identity for cos(A + B). Writing out that identity, we see that 


cos(A + B) = cos A cos B — sin A sin B = -(sin A sin B — cos A cos B), 


and we just showed that the final expression on the right equals 0. Therefore, we have cos(A + B) = 0. Since A and 
B are acute, we must have 0° < A+B < 180°, so A + B = 90°. 0 


Taking a couple different first steps helped us avoid getting stuck. We weren't sure that either one of these 
would solve the problem, but they both initially looked promising. So, we took them both, and found that one of 
them gave us a result that we could use to solve the problem. (Our “failed” start will work, too, but it’s quite a bit 
more work—see if you can figure out how.) 


a aaaaaaaaaŘŘĖĖŮĖĖŮa 
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Concept: Experiment! Don’t wait until you see the whole solution before you try 
Q= something—you might end up waiting forever. Take little steps that might be 
; mas 2 TPE MGI ee LE Tea : 


This problem was also a good example of a very common strategy in dealing with trig identities: 


Important: When stuck, try using sin? 0 + cos? = 1. 


Ve 


Problem 3.27: Find the sum of the roots of tan? x — 9 tan x + 1 = 0 that are between x = 0 and x = 27 radians. 
(Source: AHSME) 


Solution for Problem 3.27: Solution 1: Sines and cosines. 


Concept: ~ When in doubt in a a trig problem involving functions other than sine and cosine, 
@ try converting everything to sine e and cosine. 


Writing the equation in terms of sine and cosine gives 


sinx 9sinx 
e +1=0. 
cos?x cosx 


Multiplying both sides by cos? x gets rid of the fractions, and gives 
sin’ x — 9 sin x cos x + cos*x = 0. 


Aha! There’s that sin? x + cos” x again. Since this equals 1, we have 1 —9 sin x cos x = 0, which means sin x cos x = §. 
We recognize the left side as $3% , so our equation now is sin 2x = 2. What’s wrong with this finish: 


Bogus Solution: “From sin 2x = we chave 2x = = arcsin 3, so the only solution to the original 


2 
pi equation is isx= 2 arcsin § 15: 


The problem here is that arcsin å is not the only value of 2x such that sin2x = 3. What is wrong with this 


solution: 
Bogus Solution: Angles whose sine is 3 must have terminal points with y-coordinate ag 
pig There are two such points on the unit circle. One of these is the terminal 
point of arcsin 3 and the other is the terminal point of 7 — arcsin 3, since 


sin(m — 8) = sin 0. Therefore, in addition to 2x = arcsin , which gives us 
x = 4 arcsin 3, we can also have 2x = n — arcsin 2, which gives the solution. 


es 5 ~ 3 arcsin 5, The sum of these two solutions is Be 


Alas, this is another convincing-but-wrong solution. It js true that there are only two angles from 0 to 2x whose 
sines equal 2, However, we are told that the sine of 2x is = Since 0 < x < 27, we have 0 < 2x < 47. Therefore, in 
addition to the two values of 2x we found in the second Bogus Solution, we also have two more between 27 and 
4r. We get these values of 2x simply by adding 27 to the values we already found, since sin(@ + 27) = sin 0 for all 
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@. So, these extra we values of 2x are 2x = 27 + arcsin § 2 and 2x = 3n — ous +, which give x = n + 5 arcsin 3 and 
x=- 5 arcsin 2 5, respectively. The sum of these two cre for x is °=, and adding this to the stim of the first 


two sation: we found gives a total of 9% + Z = 37. 


Solution 2: It’ : a quadratic in disguise! The given equation is quadratic in tanx. To see this, we let z = tanx, 
which gives us z? — 9z + 1 = 0. The quadratic formula then gives 


9+ 81-4 9+ V77 
2 as 
Good luck finding the angles whose tangents equal those numbers without a calculator. We can at least tell that 


tan x is positive, which means that there are first and third quadrant angles that are solutions. But it doesn’t 
appear to tell us much more. Maybe this is a dead end. 


Line = 2S 


Looking back at the quadratic, we notice that the coefficients of the quadratic term and the constant term are 
both 1. This means that the product of the roots is 1. To see why, suppose the roots are r and s. Then, we can 
factor the quadratic as (z — r)(z — s). Expanding this product gives z? — (r + s)z + rs. If this equals z? — 9z + 1, then 
we must have rs = 1. 


But how does this help? All it tells us is that the roots of the equation are reciprocals. In other words, if tan 0 
is one solution, then 1/ tan @ is the other solution. But 1/ tan @ is cot 0, so now we know that we can write the two 
solutions to z? — 9z + 1 = 0 as tan @ and cot @ for some value of 8. This doesn’t tell us what @ is, but we don’t need 
to know @. We only need the sum of the possible values of 0. 


Since cot 0 = tan (z — a), we can now write the roots of z? — 9z + 1 = 0 as tan 0 and tan (3 — o) for some value 
of 0. Success! We only want the sum of the values of x such that tan? x — 9 tan x + 1 = 0. We just found that if 
x = 0 is a solution, then so is x = + — 0. Therefore, the two first quadrant solutions have sum >. Moreover, since 
tan(7 + 0) = tan 0, we simply add nx to 6 and to + — 0 to Se a two third quadrant solutions, 0 + n and  — 6. 


The sum of these is ar, so the sum of all four A is F d U c G tal 


Problem 3.28: Leta = 7/2008. Find the smallest positive integer n such that 


2[cosasina + cos 4a sin 2a + cos 9a sin 3a + -- - + cos(n?a) sin(na)] 


is an integer. (Source: AIME) 


Solution for Problem 3.28: We don’t see a good way to combine terms in the sum, so we turn to our product-to-sum 
identity for cos @ sin B, since that looks like the only tool we can use to get a different look at the problem. Applying 
this identity to first few terms gives 
1 
cosasina = ; (sin(a + a) + sin(a — a)) = 5 (sin 2a + sin0), 
1 
cos 4a sin 2a = ; (sin(2a + 4a) + sin(2a — 4a)) = 5 (sin 6a + sin(—2a)), 
1 sae 
cos 9a sin 3a = 5 (sin(3a + 9a) + sin(3a — 9a)) = 5 (sin 12a + sin(—6a)), 
ieee i 
cos 16a sin 4a = 5 (sin(4a + 16a) + sin(4a — 16a)) = 5 (sin 20a + sin(—12a)) 


At first, adding these doesn’t look promising. If only those negatives inside the sines in the last terms on the right 
were outside the sines instead—then we'd have a bunch of cancellation when we added. 


Wishful thinking is a powerful problem solving tool—thinking about what you’d | 
like to be true can focus your attention on the key step in a solution. J 


Concept: 


O= 
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We don’t have to just wish that sin(—a) = — sina; this is an identity we already proved! So, now we can write 
the products in our sum as 


cosa sina = : (sin 2a — sin 0), 
cos 4a sin 2a = ; (sin 6a — sin 2a), 
cos 9a sin 3a = ; (sin 12a — sin 6a), 

cos 16a sin 4a = ; (sin 20a — sin 12a), 


Adding all these will cancel out everything except 4 sin 20a and —} sin0. But will this cancellation continue? Will 
we always be able to cancel part of the k® term with the term before it? We check by looking at what happens 
when we apply the product-to-sum identity to the (k — 1)" and k** terms: 


cos(k — 1)°a sin(k — 1)a = : (sin(k —1+(k—-1)*)a + sin(k -1 - (k — 1)?)a) 
= ; (sin(? — k)a + sin(-F? + 3k - 2)a) 


e > (sin k(k — 1)a — sin(k — (ke —2)a), 


` 


coska sin ka = ; (sink +)a + sin(k — ka) = ; (sin(k + 1)ka — sin k(k — 1)a). 


Sure enough, we have the expected cancellation when we add these terms, so we see that when we add the 
entire original series, the only terms that will not cancel are the —} sin0 from cosa sina and $ sin(n + 1)na from 
cos nĉa sina. Therefore, we have 


1 1 
2[cosa sina + cos 4a sin 2a + cos 9a sin 3a + +--+ + cos(n?a) sin(na)] = 2 (5 sin(n + 1)na — 5 sin 0) = sin(n + 1)na. 


So, when a = 7/2008, the original summation equals sin tran, We seek the smallest positive value of n that 
makes this expression an integer. The sine of an angle is an integer if and only if the angle is an integer multiple 
of Z. Therefore, “HWT = ka for some integer k. Solving for k gives k = #2, so 1004 must evenly divide (n + 1)n. 
The prime factorization of 1004 is 2? - 251, so 251 must evenly divide n + 1 or n. The smallest positive candidate 
for n is 250, but then 2? doesn’t divide (n + 1)n. The next smallest candidate, n = 251, works, since 2? divides 252. 


So, the smallest positive integer n that satisfies the problem is n = 251. O 


A series in which part of each term cancels with part of another term in a way that allows us to write the series 
much more simply is called a telescoping series. A key step in our solution above is realizing that the sum could 
be written in a form that would “telescope” and leave us a much simpler expression. 


Concept: Manyseriesinvolving trigonometric functions can be written as telescoping series. 


(Problem 3.29: Let a, b, c, d be positive real numbers such that a? + b? = 1, c? + d* = 1, and ac — bd = 1/2. 
| Calculate ad + bc. (Source: Mandelbrot) 


Solution for Problem 3.29: The two equations a? + b? = 1 and c* + d* = 1 are in the same form as the identity 
sin? 0 + cos? @ = 1. Since a, b, c, and d are positive real numbers that satisfy these two equations, we can make 
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the substitutions cit 7 = cos x, C = sin y, and d = cos y, where x and y are first quadrant angles. With this 
substitution, we are guaranteed to have a? + b? = c? + d? = 1. Let’s see what this substitution gives us in the third 
equation: 


(sin x)(sin y) — (cos x)(cos y) = 3 
We now recognize a trig identity in the third equation as well; we can write the left side as 
(sin x)(sin y) — (cos x)(cos y) = —((cos x)(cos y) — (sin x)(sin y)) = — cos(x + y), 


so the third equation now is — cos(x + y) = , which gives us cos(x + y) = —}. Since x and y are first quadrant 
angles, we know that x + y is in the first or second quadrant. The only such angle for which cos(x + y) = -} is 120°. 


But how does this help with ad + bc? Let’s see what our substitution does to this expression: 
ad + bc = (sin x)(cos y) + (cos x)(sin y). 


Another trig identity! We have 
(sin x)(cos y) + (cos x)(sin y) = sin(x + y). 


Since x + y = 120°, we have ad + be = sin 120° = = o 


Concept: Trigonometry can be a powerful tool even in problems that don’t appear to be 
Q= about trigonometry. 
A key sign that a trigonometric substitution might be helpful is the presence of an expression or equation that 
is conspicuously in a form found in a common trig identity. 


| Problem 3.30: Evaluate 
| cos 1° + cos2° + cos3° + --- + cos 43° + cos 44° 


sin 1° + sin2° + sin 3° +--- + sin 43° + sin 44° ` 


| (Source: AIME) 


Solution for Problem 3.30: The sum of trig functions suggests the sum-to-product identities, but there are 44 cosines 
in the numerator—to which pairs should we apply the identities? 


| Concept: Look for convenient relationships among the angles in trig problems. = | 


The angles can be paired off into sums that equal 45°, which is an angle we know how to handle. Applying 
the cosine sum-to-product identity to these pairs gives 


°41° 44° -1° 45° we 

cos 44° + cos 1° = 2 cos (= as ) cos ( 5 ) = 2cos > cos -y s 
a P 43° + 2° 43° =) £ 45° 41° 

cos 43° + cos 2 = 2cos ( 5 ) cos ( 7 = 2cos 7 cos z” 
z 3 42° + 3° 42° =) i 45° 39° 

cos 42° + cos3 = 2c0s ( 5 ) cos ( 5 = 2 cos 7 cos 7? 


° 4 22° 23° — 22° 45° Ne 
cos 23° + cos 22° = 2 cos (Pa | cos (==) = 2 cos > cos z 


a 
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And when we add all these up, we get... quite a mess. We can handle cos 45° with the half angle identity, but the 
rest of those cosines are intimidating. 


= When you encounter a nasty complication in a trig problem, and can’t find an 
identity to wipe it out, look for something else in the problem to do the dirty work 
for you. 


Concept: 


Q= 


We could try pairing the cosines off again, but that will generate even more angles we don’t know how to 
handle. (Try it and see.) Let’s at least take a look at what happens with the sines in the denominator when we 
apply the sum-to-product identities. Maybe it will all work out in the end: 


sin44? + sin 1° = 2sin ( iii ) cos (= mal ) en con p 
2 2 2 
oO o 4 2 58 o o 
sin 43° + sin 2° = 2 sin (= si ) cos ( 2 ) =2sin > cos $- P 
[o] fe} oe [a] 4 ie} o 
sin 42° + sin 3° = 2 sin (= a ) cos (= 2 ) =e pies 7 
2 2 2 
[e] [e] o aa fe} is} Te 
sin 23° + sin 22° = 2sin (2) cos (2) P a 
2 2 2 D 


There are those cosines again, but this time we’re happy to see them! We can use these to cancel out the ones 
we found when we paired off the cosines. We have 


cos 1° + cos 2° +--+ + cos 44° 2cos (cos $° + cos $° +--- +.cos 3") : NE 
- - - = E oe AO a 
an i sin e sine Psin (cos $ + cos # +++++c08 }") 2 
Now, we know what to do. We apply the half angle identity for tangent to find 
a a 1 T EE a 
2 tan  (sin45°)/(1+cos45°) sin45e y2 T 


Problem 3.31: Given any seven real numbers, prove that there are two of them, x and y, such that 


0< 


(Source: Canada) 


Solution for Problem 3.31: There don’t appear to be any good algebraic tools for attacking this problem. Even 
multiplying both sides of the right side inequality by 1 + xy is complicated by the fact that 1+ xy might be negative. 
However, the expression (x — y)/(1 + xy) does resemble the angle difference identity for tangent: 


tan A — tan B 


tan(A — B) = ————————. 
a ) 1 + tan A tan B 


Another clue that we can use this identity is that x and y can be “any real numbers.” The range of tan 0 is all real 
numbers, so letting x = tan A and y = tan B is a permissible substitution, since no matter what x and y are, there 
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are angles A and B such that x = tan A and y = tanB. Writing the given inequality with this substitution, we have 


tan A — tan B a 
~ I+tanAtanB™ 3 


Applying the angle difference identity for tangent, we have 


1 
0 < tan(A — B) < —. 
Sr 
What does this tell us about A and B? The algebraic expression is not the only part of the problem we can 
relate to trig. Viewing the two constants on the ends of the inequality as tangents of angles as well, we have 


tan0 < tan(A — B) < tan = 


Now we have a different view of the problem. We can view each of the seven real numbers as the tangent of 
some angle from —5 to $, and our problem becomes: 


Given any seven angles whose measures are in the interval (- To z), prove that there are two of them, 
A and B, such that 
T 
tan0 < tan(A — B) < tan g 


We now consider what angles have tangents between tan 0 and tan Z. We have tan0 < tan(A—B) < tan Z when 
the slope of the line through the origin and the terminal point of A — B is positive but no greater than the slope of 
the line through the origin and the terminal point of 4. The only first quadrant angles for which this is true are 


0 < A-B < $. (It is also true for some third quadrant angles, but because A and B are in the interval (-3, z), the 


angle A — B cannot be in the third quadrant.) 


All we have to do now is show that for any seven angles in the interval (-3, z), there are two that are no more 


than 7 apart. To do this, we note that we can break the interval (-3, 2) into 6 non-overlapping intervals that are 


no longer than ¢: 
Cai bagl Cet Cse Gal G2 


If any two angles are in the same interval, then they are no more than % apart, and we have the two desired 
angles. If we restrict ourselves to one angle per interval, the most we can have is 6 angles. Since we have 7 angles 
distributed among these 6 intervals, there must be an interval with at least 2 angles. Letting A be the larger of 
these two angles and B be the smaller, we have 0 < A — B < 7, so tan0 < tan(A — B) < tan %, as desired. O 


Sidenote: A key step in our solution to Problem 3.31 is the observation that if 7 angles are 
distributed among 6 intervals, then at least 2 of the angles must be in the same 
interval. This is an example of the Pigeonhole Principle, which states that if more 
than n objects are distributed among n holes, then at least one hole must contain 
more than one of the objects. We discuss the Pigeonhole Principle in more detail 
in Art of Problem Solving’s Intermediate Counting & Probability. 


Problem 3.32: Evaluate (sin 1°)(sin3°)(sin 5°) -- - (sin 177°)(sin 179°). 
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Solution for Problem 3.32: We can pair off terms whose angles sum to 180°. We might try the product-to-sum 
identity (give it a try), but seeing the 180° suggests that we can also use sin(180° — 0) = sin @ to give 


(sin 1°)(sin 3°)(sin 5°) - -- (sin 177°)(sin 179°) = (sin 1°)(sin 3°) - -- (sin 89°)(sin 89°)(sin 87°) - -- (sin 3°)(sin 1°). 


We can now pair up the sines as squares of sines. This suggests trying sin? 9 = 1 — cos? 0, but that turns our 
product into a product of differences, which looks scary. If only those products of two sines of the same angle 
were the product of the sine and the cosine of the same angle, then we could use the sine double angle identity in 
reverse to simplify the problem. 


This bit of wishful thinking pays off when we recognize that we can again pair off the angles in a convenient 
way, since we can pair them such that the angles sum to 90°. We want to turn some sines into cosines and we have 
angles that add to 90°, so the identity sin 0 = cos(90° — 0) might help us out. Applying this to the second batch of 
sines in the product gives: 


(sin 1°)(sin 3°) -- - (sin 89°)(sin 89°)(sin 87°) -- - (sin 3°)(sin 1°) 
= (sin 1°)(sin 3°) - -- (sin 89°)(cos 1°)(cos 3°) - - - (cos 87°)(cos 89°). 
Notice that we could have gotten straight to this from this initial product by applying sin 0 = cos(0 — 90°). 


Concept: There are many ways to tackle most trig identity problems. You won't always 


find the fastest way, and reviewing your solutions may reveal relationships that 
will help you solve future problems. 


We can now apply the sine double angle identity in reverse: 


(sin 1°)(sin 3°) - - - (sin 89°)(cos 1°)(cos 3°) - -- (cos 89°) = (sin 1°)(cos 1°)(sin 3°)(cos 3°) - - - (sin 89°)(cos 89°) 


(eres, 


= in 2°)(sin 6°) --- (sin 178°). 


Once again, we have angles that pair to 180°, so we try sin 0 = sin(180° — 0). But we have to be careful: 


WARNING! Watch out for special cases. When pairing off terms in a sequence, a sum- 


mation, or a product, you have to pay special attention to the middle term if 
there are an odd number of terms. 


Here, we have 45 terms, so we can’t pair them all off. There’s one extra term—the middle term, which is 
sin 90°. But this simply equals 1, so we can continue without it, since it doesn’t affect the product. After applying 
the identity sin(180° — 0) = sin @ to all terms with angles greater than 90°, we have 


1 


was (sin 2°)(sin 6°) --- (sin 178°) = zz (sin 2°)(sin 2°)(sin 6°)(sin 6°) --- (sin 86°)(sin 86°). 


Uh-oh. We can’t pair these angles off to add to 90° like we did before. If we apply sin 0 = cos(90° — @) to the 
second sine in each pair, we have 


x (sin 2°)(cos 88°) (sin 6°) (cos 84°) - -- (sin 86°)(cos 4°). 


Now what? Looks like we're stuck. 
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Concept: Don’t get so mired in one-step tactics that you stop thinking strategically. When 


O= hopelessly stuck, step back and think more broadly about what you're trying to 
accomplish. | 


We'd like to make the huge product 
degen RS 
255 (sin 2°)(sin 2°)(sin 6°)(sin 6°) - - - (sin 86°) (sin 86°) 


collapse in some way. We collapsed a sum in Problem 3.28 by writing each term as a difference, so that part of each 
term cancels with another. To do the same in a product, we write each term as a quotient, hoping the denominator 
of each term will cancel with the numerator of another. So, we look for ways to rewrite some of our terms as 
quotients. A natural candidate for this approach is the sine double angle formula, because we’ve used it fruitfully 
this way before. 

We write sin2° = #4, and think about whether or not doing the same for other terms will produce the 
desired cancellation. It’s not immediately obvious that the cos 2° will cancel with a numerator in another term, 
but the sin 4° makes us focus on the sin 86° term, since cos 86° = sin 4°. Indeed, we have sin 86° = ##12— and we 


~ 2cos 86°’ 
see that we'll get some cancellation. So, rewriting all the terms in our product similarly, we have: 


1 1 / sind” Voy cin sin 172° \? 
EE ene ae eno sin 86°)(ain 86° = 5 ( ) (=) (=) 
a ae on 2) in 6") (cin Gin ) 24 \2cos2° 2 cos 6° 2 cos 86° 


We then apply cos 0 = sin(90° — 0), knowing we will get some cancellation. We find: 


a ( sin 4° I ( sin 12° i) ( sin 172° y D ( sin 4° j (> e e 

2 20s 2° 2 cos 6° 2cos86°/ 245 24 \sin88° sin 84° sin 80° sin4° / ` 
We see that some of the sines immediately cancel, but some are left over. The sine of each multiple of 4° less than 
90° appears in a denominator. The numerator contains sines of every multiple of 4° less than 180° that is not a 


multiple of 8°. Cancelling the common sines leaves only sines of multiples of 8° in the denominator and the sines 
of angles greater than 90° in the numerator: 


I i ( sin 4° J (@ aa e a (a ey ie (c 92°)(sin 100°)(sin 108°) --- (sin ee) 
DE D A SMES sin 84° sin 80° sin 4° — 289 (sin 88°)(sin 80°)(sin 72°) - -- (sin 8°) 
And now, the identity sin 0 = sin(180° — 0) provides the finishing touch: 
1 = 92°)(sin 100°) (sin 108°) - - - (sin 172°) ) PL (= 92°)(sin 100°)(sin 108°) - - - (sin nea eli 
289 \ (sin 88°)(sin 80°)(sin 72°) - - - (sin 8°) ~ 289 \ (sin 92°)(sin 100°)(sin 108°) - - - (sin 172°) 289° 


The final result suggests that there might be a better way to solve this problem—there are 90 terms in the 
original product and the answer is 2 - 5%. Let’s look for a better solution. 


Concept: If you do a lot of arduous work on a problem and come to a relatively simple 
O==2 answer try to use that answer as a guide to find a simpler solution. 


So, we go back to the original product: 
(sin 1°) (sin 3°)(sin 5°) - - - (sin 177°) (sin 179°). 


We’d like to find a way to introduce 89 or 90 factors of 4. We’ve seen how to introduce factors of } with the double 


angle formula for sine, such as when we wrote sin 2° = pose . We expect that this means we'll have to introduce 
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terms with an even number of degrees, since we'll be using the sine double angle formula. A natural choice now 
is to introduce all the missing even degree measures by writing the desired expression as a quotient: 


(sin 1°)(sin 2°)(sin 3°) - -- (sin 178°)(sin 179°) 


(sin 1°)(sin 3°)(sin 5°) sane (sin 177°)(sin 179°) = (sin 2°)(sin 4°)(sin 6°) -- - (sin 176°)(sin 178°) 


This gives us 89 terms in the denominator, which suggests we’re on the right track. We need to pair up factors in 
the numerator in a way that allows us to finish with the sine double angle formula. Therefore, we need to find 
cos 1°, cos 2°,...,cos 89° to pair up with sin 1°, sin 2°,...,sin 89°. This means we have to focus on turning the sines 
of obtuse angles into cosines of acute angles. The identity sin 0 = cos(@ — 90°) does the trick, and we have 


(sin 1°)(sin 2°)(sin 3°) --- (sin 178°)(sin179°) _ (sin 1°)(sin 2°) - - - (sin 89°)(sin 90°)(cos 1°)(cos 2°) - - - (cos 89°) 
(sin 2°)(sin 4°)(sin 6°) - -- (sin 176°)(sin 178°) (sin 2°)(sin 4°)(sin 6°) - - - (sin 176°)(sin 178°) 
_ (sin 1°)(cos 1°)(sin 2°)(cos 2°) - - - (sin 89°)(cos 89°) (1) 
7 (sin 2°)(sin 4°)(sin 6°) - - - (sin 176°)(sin 178°) 
(4 sin2°) (4 sin 42°) --- (4 sin 178°) 
~ (sin 2°)(sin 4°)(sin 6°) - - - (sin 176°)(sin 178°) 
al 


= 555° 


That’s much easier, but it would have been a pretty hard solution to find without the big clue that we’re looking 
for 89 factors of }. With that clue in mind, we found a new strategy: 


Concept: Some products can be evaluated by strategically multiplying by expressions that | 
Q= equal 1 in order to introduce convenient cancellation. 


We did so in our second solution by multiplying by a bunch of factors of the form 


sin 2n oO 


sin 2n° ° 


Our solution to Problem 3.32 is a good example of how to tackle hard problems. We didn’t have a single 
magical observation that solved the problem. We often took steps without knowing for sure that they’d lead to a 
solution. We used wishful thinking to help guide us, and stepped back when one of our forward steps proved to 
be a blind alley. Above all, we didn’t give up—we kept trying new approaches until we found a combination of 
tactics that solved the problem. And after we solved the problem, we used the answer, together with some other 
clues in our solution, to go back and find a nicer solution. 


[Exercises BES 
3.5.1 Find A such that 0° < A < 90° and cos 41° + sin 41° = V2sin A. (Source: NYSML) 


3.5.2 Find a solution for Problem 3.29 that doesn’t require trigonometry. 


3.5.3 Determine the sum of the angles A and B, where 0° < A, B < 180°, and 


sin A + sin B = o cos A + cos B = iG 


(Source: COMC) (Note: This is not exactly the same as Problem 3.26.) 
3.5.4 Iftanx+tany = 25 and cotx + cot y = 30, then what is tan(x + y)? (Source: AIME) 


3.5.5 Let A and B be acute angles such that tan A = 1/7 and sinB = 1/ V10. Find the degree measure of A + 2B 
without using a calculator. 
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3.5.6 Find all acute angles 0 such that sin 0 + sin 20 = cos 8 + cos 20. 


3.5.7 Given cos(a + f) + sin(a — $) = 0 and tan $ = 1/2000, find tana. (Source: HMMT) 


3.5.8x Given that 


(1 + sint)(1 + cost) = >, 


find (1 — sin f)(1 — cost). (Source: AIME) Hints: 67, 44 
3.5.9x Solve the system 


2x + yx? =y, 
2y +z% =z, 
Qe xz =x, 
Hints: 240 
3.6 Summary 


In this chapter, we explored the following identities: 


sin? 0 + cos? 0 =1 sin(—0) = -sin 0 
cos(—9) = cos 0 


tan(—0) = — tan 0 


tan? 0 + 1 = sec? 0 
co? 0 +1 = cs 0 


Angle sum and difference identities: 


sin(a + f) = sin a cos f + sin f cosa 
cos(a + f) = cos & cos f — sina sin f 
tana + tan f 


Ooo- 1- tang tan f 


Double angle and half-angle identities: 


sin 2x = 2 sin x cos x 


cos 2x = cos? x — sin? x = 2 cos? x — 1 = 1 — 2 sin? x 


2tanx 


tan2x = = 
1 — tan? x 


Product-to-sum and sum-to-product identities: 
1 
cos & cos f = > (cos(a + p) + cos(a — f)) 
1 
sinasinf = 5 (cos(a — p) — cos(a + p)) 


sin a cos B = ; (sin(a + p) + sin(a — B)) 


sin(90° — 0) = cos 0 
cos(90° — 0) = sin 0 
tan(90° — 0) = cot 0 


sin(180° — 0) = sin@ 
cos(180° — 0) = — cos 0 
tan(180° — 0) = — tan 0 


sin(a — 8) = sin a cos f — sin f cos q 
cos(a — B) = cos æ cos f + sin a sin f 
tana — tan f 


a 1 + tana tan f 


an /1+cos@ 
2 2 

cin oe 1-— cos 0 
2 2 

ee sin 0 _ 1~cos6 
2 1+cosé sin 0 


cos x + cos y = 2cos (=) Cac (== 
cosx — cos y = —2sin (=) an (=) 
E x-y 


> : ae es 
sinx + sin y = 2sin ( 


5 ; . (X 
sinx — siny = 2sin ( 


S o m ns 
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Things To Watch Out For! 


Problem Solving Strategies 


WARNING! 
r™ | 


Whenever you solve an equation by performing some possibly a 
steps, you must check your solutions at the end, to make sure they satisfy the 
original equation. Two common irreversible steps are squaring an equation | 


and multiplying both sides of an equation by an expression that might equal | 


Zero. 


If a tactic makes some progress on a problem, but doesn’t entirely solve it, | 


=a look for a way to use the tactic again. 


e Don’t get locked into one way of attacking a problem—look for several 
starting points, go a little ways down each path, and then choose the a 
that looks most fruitful. | 


e Experiment! Don’t wait until you see the whole solution before you try | 
something—you might end up waiting forever. Take little steps that might | 
be promising, and reassess after each step. | 


Wishful thinking is a powerful problem solving tool—thinking about what | 
you'd like to be true can focus your attention on the key step in a solution. | 


A fruitful first step in many trig problems is rewriting the problem in terms 
of sine and cosine, because most people have more experience with sine and 
cosine than with the other trig functions. 


When solving an equation that contains trig functions, it’s often best to write 
the equation in terms of a single trig function, so that you can then try to 
solve the equation for that function. 


Recognizing the forms of common trigonometric identities can help solve 
many trigonometric problems. 


Recognizing relationships among angles in a complicated trig expression 


can help simplify the expression. | 


Many series involving trigonometric functions can be written as telescoping 
series. 


Trigonometry can be a powerful tool even in problems that don’t appear to 
be about trigonometry. 


Extra! You may have heard that prior to calculators, people sometimes used logarithm tables to perform 
inp inno Multiplication and division of large numbers. But what did they do before mathematicians 
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developed logarithms? They used trig! More specifically, they used the product-to-sum and: 
product-to-difference identities to turn products into easier-to-handle sums and differences. If 
you can’t figure out how this works, look up prosthaphaeresis on the internet. 


REVIEW PROBLEMS 


Em ~ 


3.33 Prove that tan(m — 0) = —tané@. 


3.34 Express sin(@ — 7) in terms of cos 0 and/or sin 8. 


3.35 Prove each of the identities below using only the identities sin(—x) = sin x, cos(—x) = — cos x, sin(90° — x) = 
cosx, sin? x + cos? x = 1, and sin(x + y) = sin xcos y + sin ycosx. Try to do this problem without looking back in 
the book. The goal of this problem is to learn that you do not have to memorize all of these identities—you can 
rebuild them from a small group of basic identities whenever you need them. 


(a) cos(x- y) = cosxcos y + sin xsin y (d) cosž= „198 for0<x< n 


(b) sin2x = 2sin xcosx (e)  cosxcosy= 3 (cos(x + y) + cos(x — y)) 
x-Yy x+y 


sinx=siny = 2sin (5) cos (*2) 


tan x+tan y 
TET, (f) 5 


(c) tan(x+y)= 1-tanxtany 
3.36 Simplify sin(x — y) cos y + cos(x — y) sin y. (Source: AHSME) 


3.37 Which of the following equals V2 cos (3 = x) ? 


(a) cosx+sinx (c) cosx- sinx 
(b) -cosx +sinx (d) -cosx- sinx 
-3 + 20 
3.38 Determine the constants 4 and b so that Stor” =a+bsin@. (Source: CEMC) 


3.39 What is the range of the function g(t) = 2sint + 3 cos £? 
tan? 20° = sin? 20° 
tan? 20° sin? 20° 
tan(a—fB)+tanp _ 
1-—tan(a—f)tanp — 


3.40 Compute . (Source: HMMT) 


3.41 Show that tana. 


3.42 Express tan (% + o) in terms of tan 0. 
3.43 Show that there is no triangle ABC for which cos A cos B — sin Asin B = 1. 


3.44 The complicated trigonometric expression shown below is equivalent to either sin x, cos x, or tan x. Which 


one is it? 
¥3 sin(x + 30°) — cos(x + 30°) 


4 cos xsin(x + 30°) — 4 sin x cos(x + 30°) 
(Source: Mandelbrot) 
3.45 Evaluate tan (arccos (-1) + arcsin 2). 
3.46 Compute (sin 15° + sin 75°)®. (Source: ARML) 


sin2x  cos2x 
sin x cosx ` 


3.47 Simplify 


3.48 Find all possible values of sin 26 if cos 0 = —0.6. 
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cotx— 1 £ cos 2x 
cotx+1 1+sin2x’ 


3.50 Ifsin2x = 24/25, find the value of sinf x + cost x. 


3.49 Show that 


3.51 Find tan sr, 
3.52 Find all x such that 0 < x < 27 and sin2x < sinx. 


3.53 For -n < 8 < 7, find all solutions to the equation 2(sin? 0 — cos? 0) = 8 sin 0 — 5. (Source: CEMC) 


sin 0 + sin 20 
how that ————————- =t if 0 < p 
(a) Show tha o A andif0<@<n7/ 


(b) If 7/2 < 0 < 7, for what value(s) of 0 is the identity in part (a) not true? (Source: CEMC) 
3.55 Find the acute angle @ such that 2 sin 52° sin 68° = 5 + sing. 


; a tan x cot x 
3.56 Prove the identity Tage 1 ee Caw il. 
= co — tan 


, ok eee 
3.57 Show that sg UNSERE = tan 3x for all x for which tan 3x is defined. 
cos x + cos 3x + cos 5x 


Challenge Problems ff 


3.58 Find the value of tanx if sin x + cos x = 1/5 and 7/2 < x < 7. Hints: 44 


3.59 If f (4) = 1 for all x #0, 1, and 0 < 0 < 77/2, then find f(sec* 0). Hints: 156 


7% 


He : cosx — sinx 
3.60 Compute the smallest positive angle x, in degrees, such that tan4x = ————— 


Hints: 261 


——. (Source: NYSML) 
cosx + sinx 


3.61 Determine all points at which the graphs of y = 8cosx + 5tanx and y = 5secx + 2sin 2x intersect, where 
0<x<2n. (Source: CEMC) 


3.62 Find all x between —7/2 and 7/2 such that 1 — sinf x — cos? x = 1/16. (Source: HMMT) 


sin x + sin(x + y) 


3.63 Let y be a fixed real number, 0 < y < 7, and let f(x) = for 0 < x < n — y. Show that f is 


cos x — cos(x + y) 
constant. Hints: 166 
3.64 Suppose that secx + tan x = 22/7. Find cscx + cotx. (Source: AIME) Hints: 147 


L rS A 
: l dy Wie lee == Boe BS : 
3.65 Prove that for all real x and y, we have 2 Goa 2 Hints: 253 


3.66 Determine all O such that0 < 8 < A and sin? @ + cos? @ = 1. Hints: 122 


3.67 Find, with proof, all real numbers x between 0 and 27 such that tan 7x — sin 6x = cos 4x — cot7x. (Source: 
USAMTS) Hints: 267 


3.68 Find the value of 10 cot(cot"! 3 + cot! 7 + cot! 13 + cot™! 21). (Source: AIME) Hints: 26 


3.69 Find cos z - COS z - COS t, Hints: 223 
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(a) Express tan 3x in terms of tan x. 


(b) Let @ be the angle in the second quadrant for which cos@ = —8/17. Determine tan(30/2). (Source: 
Mandelbrot) 


(c) Solve for x: tan 20° tan 40° tan 80° = tan x. Hints: 72, 20 


3.71 Ifsin@ + cos@ + tan@ + cot @ + sec 0 + csc 0 = 7, then find sin20. (Source: NYSML) Hints: 99, 168 


3.72x The first two terms of a sequence are 41 = 1 and a2 = Fe Forn > 1, 


_ ln t+ an+ 
Le = aa 
1 = anān+1 
What is |a@2009|? (Source: AMC) Hints: 208 
3.73x Find the least positive integer n such that 
1 1 1 1 


-c t a H e = ĪÃ—IMMMI 
sin 45° sin46° sin47° sin 48° mee sin 133° sin 134° sinn? 
(Source: AIME) Hints: 108, 86 


3.74% Find the sum of the values of x such that cos? 3x + cos? 5x = 8 cos? 4x cos? x, where x is measured in degrees 
and 100 < x < 200. (Source: AIME) Hints: 178 


3. (hes An angle x is chosen at random from the interval 0° < x < 90°. Find the probability that the numbers 
sin’ x, cos? x, and sin x cos x are not the lengths of the sides of a triangle. (Source: AIME) 


25-4 SS Hints: 153 


3.76* Show that if x + y + z = xyz, then -—5 y tae eae jo 


3.77x The sequence {4,} satisfies a9 = 0 and an41 = Ban + E 4” — a2 for n > 0. Find ayo. (Source: AIME) 
Hints: 145, 216 


n 
1 
3.78x Prove that ` arctan —~ = arctan 
k=1 


n 
2k2 n+1- 


Sidenote: On page 111, we asked what causes the oscillations we hear when tuning an 
A instrument that is almost in tune, but not quite. To investigate, we graphed- 

y = sin 30x + sin 30x, y = sin 31x + sin 29x, and y = sin 32x + sin 28x. Applying the 
sum-to-product identity gives us a clearer picture of what's going onin the graphs. 
While sin 30x + sin 30x is just 2sin30x, applying the sum-to-product identity for 
sine gives : 


sin 31x + sin29x = 2 sin 30x cos x, 
sin 32x + sin 28x = 2 sin 30x cos 2x. 


In each case, the 2sin30x of the in-tune guitar is damped by an extra term that 
oscillates. Since cos 2x oscillates twice as fast (has half the period) as cosx, the | 
sound oscillates much faster for the sin32x + sin28x guitar than it does for the 
sin 31x + sin 29x guitar. 
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CHAPTER 4. APPLICATIONS TO GEOMETRY 


Geometry is the science of correct reasoning on incorrect figures. - George Polya 


CHAPTER 


a to Geometry 


We explore many applications of trigonometric functions to geometry in this chapter. Throughout the chapter, 
you will encounter trig functions of angles that you cannot evaluate in your head or with pencil and paper, such 
as sin 14°. When you encounter such expressions in this chapter, you can use a calculator to approximate them. 


4.1 Right Triangle Trigonometry 


Back on page 35, we proved the following: 


Important: If triangle XYZ is a right triangle with ¿XYZ = 90° and X 
ZX = 0, then 


eee 2 le _ 
sin @ = XZ’ cos 0 = zz, tan 0 = xy’ 
Another way of viewing this is: If @ is the measure of y a Z 
an acute angle in a right triangle, then we have 
ae opposite ce gene adjacent leg! TE OPRIE leg 
| hypotenuse hypotenuse adjacent leg 


= se 


Some people remember these relationships with the mnemonic “SOHCAHTOA” (pronounced “so-cah-toe- 
ah”): 
Sine: Opposite/Hypotenuse, Cosine: Adjacent/Hypotenuse, Tangent: Opposite/Adjacent. 


In this section, we apply these relationships to a variety of problems. 
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4.1. RIGHT TRIANGLE TRIGONOMETRY 


Important: Throughout this chapter we will use some very common conventions when 
working with angles of a triangle and trigonometric functions of those angles. 

Specifically, when referring to the angles of AABC, we will often refer angles of 
the triangle solely by their vertices. That is, we will write ZA, ZB, and ZC rather | 
than using three points to identify each angle. When writing trigonometric | 
functions, we will often omit the Z symbol and simply write sin A, which is the 
same as sin BAC. When referring to the side lengths of AABC, we often let 
a= BC, b = AC, and c = AB. 


_=e_ > 


Problem 4.1: 
(a) In AABC, we have ZABC = 90°, AC = 6, and ZBAC = 24°. Find BC and AB. 


(b) In APQR, we have PQ = 7, QR = 24, and ZPQR = 90°. Find ZQPR. 


T sn ON TT E i ar 


Problem 4.2: The angle of elevation is the angle above the horizontal at which a viewer must look to see an 
object that is higher than the viewer. Similarly, the angle of depression is the angle below the horizontal at 
which a viewer must look to see an object that is below the viewer. 


Object Horizontal 
Angle of Depression 


Viewer 


Angle of Elevation 


eee Horizontal Object 

(a) A surveyor measures the angle of elevation from her feet to the top of a building as 5°. The surveyor 
knows that the building is 500 feet tall. Assuming the ground is flat and level between the surveyor and 
the building, how far away is the surveyor from the building? 

(b) A bee is ona hill looking at a building. The building is 400 feet tall. The angle of elevation from the bee 
to the top of the building is 4° and the angle of depression from the bee to the bottom of the building is 
2°. What is the shortest distance the bee will have to fly to reach the building? (Answer to the nearest 
hundred feet.) 


Problem 4.3: Suppose the slope of a line graphed in the Cartesian plane is m, where m # 0. Show that m equals 
the tangent of one of the angles that the line makes with the x-axis. 


Problem 4.4: In right triangle ABC, ZABC = 90° and AC = 4. If P is on AC such that BP = 1 and BP L AC, then 
find all possible values of ZBCP. (Source: CEMC) 


Problem 4.5: In AABC, let a = BC and b = AC, and let [ABC] be the area of AABC. Show that [ABC] = Sabsin O 


Problem 4.6: What is the largest possible area of a triangle that has two consecutive sides of length 10? Prove 
that no larger area is possible. 


Problem 4.7: ABCD is a square and M and N are the midpoints of BC and CD, respectively. What is sin ZMAN? 
(Source: AHSME) 
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Problem 4.1: = 
(a) In AABC, we have ZABC = 90°, AC = 6, and ZBAC = 24°. Find BC and AB. 


(b) In APOR, we havé PQ = 7, OR = 24, and POR = 90°. Find ZQPR. 


Solution for Problem 4.1: 
(a) We have C 
6 
AC A B 
ZBAC = aÈ 
cos = JC 
Using a calculator to compute sin 24° and cos 24°, we find 
BC = ACsin ZBAC = 6sin 24° = 2.44, 
AB = AC cos BAC = 6cos 24° x 5.48. 
(b) Wehave P 
QR 24 
PR = Se = Se, 
tan ZOPR PO 7 7 
Using a calculator, we find that Q 24 £ 


ZQPR = arctan = x 73.74°. 
O 


Trigonometry has a great many real-world applications. Surveyors and (perhaps more interesting to you) 
video game programmers use trigonometry frequently. Here’s an example; we'll see several more throughout 
this chapter. 


Problem 4.2: The angle of elevation is the angle above the horizontal at which a viewer must look to see an 
object that is higher than the viewer. Similarly, the angle of depression is the angle below the horizontal at 
which a viewer must look to see an object that is below the viewer. 


Object vei Horizontal 
i Angle of Depression 


Angle of Elevation 


Vi 
cae Horizontal Object 


(a) A surveyor measures the angle of elevation from her feet to the top of a building as 5°. The surveyor 
knows that the building is 500 feet tall. Assuming the ground is flat and level between the surveyor and 
the building, how far away is the surveyor from the building? 


(b) A bee is ona hill looking at a building. The building is 400 feet tall. The angle of elevation from the bee 
to the top of the building is 4° and the angle of depression from the bee to the bottom of the building is 
2°. What is the shortest distance the bee will have to fly to reach the building? (Answer to the nearest 
hundred feet.) 
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4.1. RIGHT TRIANGLE TRIGONOMETRY 


Solution for Problem 4.2: 


(a) 


(b) 


The diagram at right depicts the building, with the surveyor at point Z. We 
seek ZX, which is one leg of AXYZ, and we know the other leg of AXYZ. When 
we apply trigonometric relationships to AXYZ, we focus the side we know, the 
side we want, and the acute angle we know: 


Building 


Z 
YX X 
tan ZYZX = =. 
an ZX 
Solving for ZX gives us 
x= YX 500 
~ tan ZYZX tan5? 


Therefore, the surveyor is approximately 5715 feet from the building. 


z 5715, 


In the diagram, the bee is at point B, and the distance from the bee Y 

to the building is BT, where T is the foot of the perpendicular from 

the bee to the building. We have two right triangles, and we know 

the measures of the acute angles in those triangles, but we don’t z= 
know any of the side lengths of those triangles. In hopes of finding pg g 
useful relationships among lengths of segments in the diagram, we n aT] 

let BT = x, and try to find other lengths in the diagram in terms of 

x. We assign a variable to BT both because BT is a side of both right 
triangles, and because BT is the length we seek in the problem. 


We know that XY = 400, so we focus on the segments along XY. From ABTY, we have tan ZTBY = 1L, 


so 
YT = BT tan ZTBY = xtan4° ~ 0.070x. 


From ABTX, we have tan ZTBX = ar so 


XT = BT tan ZTBX =xtan2 = 0.035x. 


Since we know that XY = 400, we now have an equation for x. We have XY = YT + XT ~ 0.105x, so 
0.105x ~ 400, from which we find x ~ 3800. Therefore, the bee is approximately 3800 feet from the building. 


Our key step in this solution was the first one: 


Concept: Many geometry problems are solved by assigning variables to lengths or angles 


Q= _ inthe problems, and then finding other lengths or angles in terms of the variables. 
The goal then is to find an equation that you can solve. l 


In addition to the many applications of trigonometry to Euclidean geometry, there are also applications to 
analytic geometry. 


Problem 4.3: Suppose the slope of a line graphed in the Cartesian plane is m, where m # 0. Show that m 
equals the tangent of one of the angles that the line makes with the x-axis. 


a at aL a 


Extra! There is a difference between not knowing and not knowing yet. 


= Shelia Tobias 
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Solution for Problem 4.3: We'll first address a line with positive slope. In the diagram 

at right, the line makes an acute angle of @ with the x-axis as shown. (We don’t 

need the y-axis for this problem, so we haven't included it.) Point Y is on the line 

above the x-axis, and point Z is the foot of the altitude from Y to the x-axis. The 

slope of this line is positive, so we have m = X, From right triangle Y XZ, we have 
YZ 


tan = >> =m. 


Turning to a line with negative slope, we start with the diagram at left. 


4 Point P is on the line above the x-axis, and Q is the foot of the altitude from P to 
the x-axis. As we move upward along the line, we go to the left, not the right, 
so we have m = a However, in APQR, we have 

0 tana = POR —m. 
Q R x-axis QR 


Uh-oh. The problem says that the tangent of one of the angles the line makes 
with the x-axis should be equal to the slope, not the negative of the slope. Let’s take a look at the obtuse angle the 
line makes with the x-axis. Since this angle together with a makes a straight angle, we have a + @ = 180°, so 
sin(180° - a) _ sina 
cos(180° -a) -cosa 


tan 0 = tan(180° — a) = = —tana. 
Therefore, we have tan @ = —(—m) = m, and once again, the tangent of one of the angles formed by the line and 
the x-axis equals the slope. O 


A line with nonzero slope m must intersect the x- 
axis. The line and the x-axis form an angle @ that 
is to the right of the line and above the x-axis, as 


shown. Then, we have tan 0 = m. ae | 


Our solution to Problem 4.3 gives us a quick way to visualize this relationship, so you shouldn’t have to 
memorize it. 


Important: 


Problem 4.4: In right triangle ABC, ZABC = 90° and AC = 4. If P is on AC such that BP = 1 and BP L AC, 


then find all possible values of ZBCP. (Source: CEMC) 


Solution for Problem 4.4: We let the desired angle be 0, and present two solutions: 


Solution 1: Find sides in terms of 0. We have one side length in right triangle ABC and 
one in right triangle BPC. If we could find one more side in either triangle, we could 
use the trigonometric relationships in a right triangle to find 0. We focus on BC, since 
this segment is a side of both triangles. In ABPC, we have sin 0 = BF = ve so BC = sia: B C 
In AABC, we have cos @ = Be = BC, so BC = 4cos@. (Notice that in each case, we 
investigate the trig function that we can express in terms of BC and a side length that we know.) We now have 


two different expressions for BC, so these expressions must be equal: 


= 4cos 60. 


sin 
Multiplying both sides by sin 0 gives 4sin@cos@ = 1. Applying the sine double angle identity (in “reverse”) 
makes our equation 2sin20 = 1, so sin2@ = Ł, Since @ is acute, there are two possible solutions, 20 = 30° or 
26 = 150°. The former gives @ = 15° and the latter gives 0 = 75°. So, the two possible values of ZBCP are 15° and 


752 
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We can see why the possibilities for ZC are complemen- A @ 
tary in the diagrams at right. Specifically, for any triangle 
ABC that satisfies the problem, we can swap the labels A 
and C to produce another triangle that satisfies the prob- 
lem. Since ZA + ZC = 90°, we know that these two possi- 
bilities for ZC must be complementary. 5 c 5 3 


__ Solution 2: Area. All those right angles suggest that we might be able to use area to solve this problem. Since 
BP is an altitude of AABC to side AC, we have 


[ABC] = 2 


(BP)(AC) _ 
— 
where [ABC] denotes the area of AABC. We can also express the area of AABC as half the product of its legs. 
Using trigonometric relationships in AABC, we can express both legs in terms of 0. As explained above, we have 
BC =4cos @. We also have sin 0 = ab = AB so AB = 4sin @. Therefore, 


VARNES) = 8sin@cos 0. 


[ABC] = 
We now have two expressions for [ABC]. Setting these equal gives 8 sin 0 cos 0 = 2, and the rest of the solution 
proceeds as before. O 


Both solutions involve the key first step of assigning a variable and expressing lengths in terms of that variable. 
The second offers a very useful general geometric problem solving tactic: 


| Concept: Area can be a surprisingly powerful problem solving tool, even in problems that 
| Q= ~~ don't appear to be about area. : 
i me e 


Having seen one example of a relationship between trigonometric functions and area, let’s now take a look at 
a more general one. 


| Problem 4.5: In AABC, let a = BC, b = AC, and c = AB, and let [ABC] be the area of AABC. Show that 


[ABC] = SabsinC. 


Solution for Problem 4.5: Our usual formula for triangle area is “one-half base times B 

height.” We seek a way to express a height of the triangle in terms of ZC, hoping to 

convert the “one-half base times height” formula to the desired formula. So, we draw 

altitude BP, and we have sy 
(BP)(AC) _ (BP)(b) 


[ABC] = —, = 


What’s wrong with this finish: 


The altitude introduces a right triangle with acute angle ZC. We have 


= we 


Bogus Solution: 
sinC = BE = =-,so BP = asinC, and 


_(BPXb)_ 1 
[ABC] = m had sin C. 


So, for any AABC with a = BC and b = AC, we have [ABC] = 5ab Sinem 
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The problem here is that our proof does not adequately cover every possible triangle ABC. The Bogus Solution 
only covers the the case in which ZC is acute. What happens if we have ZC = 90°? Then P is at C, which presents 
some problems. Worse yet, what if ZC > 90°? We still have to address these two cases. (Our proof above does 
handle the cases in which ZA = 90° or ZA > 90°; make sure you see why.) 


When proving a statement for all possible triangles, make sure your proof is 
valid for all possible triangles. Specifically, you may have to address right 
and obtuse triangles as separate cases from acute triangles. 


NING! 
r=} 


a 


WAR 


Fortunately, taking care of the other cases is pretty easy. If ZC = 90°, then we have [ABC] = CEO = hab. 
Since sin 90° = 1, the proposed formula also gives [ABC] = }absin90° = }ab, so the proposed formula gives the 


correct area when ZC = 90°. 


If ZC > 90°, we have the diagramat right, where again our first step is drawing B 
the altitude from B. From ABCP, we have sin ZBCP = #5, so BP = BC sin ZBCP = 
asin ZBCP. Therefore, we have 


| 
| 
l 
B i l 
eoo oe Clee A E anne 
2 2 2 | 
That’s close, but not quite the same as our desired formula, [ABC] = sab sinZACB. A C IP 


However, we have ZBCP = 180° — ZACB, so 
1 
[ABC] = 5% sin ZBCP = ab sin (180° — ZACB). 


Applying the identity sin 8 = sin(180° — @) gives [ABC] = }absin (180° — ZACB) = 4absin ZACB, so the desired 
formula works when ZC is obtuse as well. 


Important: . 


VY 


Ifa= BC, b= AC, and [ABC] is the area of AABC, then 


[ABC] = sab sin C. 


Problem 4.6: What is the largest possible area of a triangle that has two consecutive sides of length 10? Prove 


that no larger area is possible. 


Solution for Problem 4.6: Let the triangle be AXYZ with XY = YZ = 10. We can relate the area of XYZ to the two 
sides with the formula we just proved: 


[XYZ] = 5(XY(YZ) sin ZXYZ = 50 sin ZXYZ. 


The maximum possible value of sin ZX YZ is 1, which occurs when ZXYZ = 90°. Therefore, the maximum possible 
area of AXYZ is 50, which occurs when ZXYZ is a right angle. 0 


Problem 4.7: ABCDisa square and M and N are the midpoints of BC and CD, respectively. What is sin ZMAN? J 


(Source: AHSME) 


Solution for Problem 4.7: We present two solutions. 
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Solution 1: Trig Identities. We start with the diagram at right, in which a is the angle D N C 
whose sine we seek. While we don’t have a right triangle that includes « as an acute angle, 
we can easily relate a to acute angles BAM and ZDAN in right triangles BAM and DAN. 
Because ABAM and ADAN are congruent by SAS Congruence (AB = AD, ZB = ZD, and 


BM = DN), we have ZDAN = BAM. Letting both of these angles have measure 0, we wd 
have 
sina = sin(90° — 20) = cos 20 = 2cos? 0 — 1. 


A B 
(We used the two identities sin(90° — x) = cos x and cos 2x = 2 cos? x — 1 here. We could 
have used either of the other cosine double angle identities, as well.) 


From right triangle MAB, we have cos 0 = 42. Since M is the midpoint of BC and ABCD is a square, we have 
MB = Œ = 4B Therefore, applying the Pythagorean Theorem to ABAM gives 


1 


2 
AM = VAR? + BM? = ap? AB _ ag 


o AB E2 
which means Fie ae and we have 


5 
AB \2 DN 3 
ina =2 29-1=2(47) - 29 ae 
sina cos e372. 1 


Solution 2: Area. We let AB = AD = 2x, so MB = MC = NC =ND =x. (Welt D x N x C 
AB = 2x rather than AB = x to avoid introducing fractions when computing the shorter 
segments.) We draw MN to divide the square into three right triangles and AAMN. The xX 


square has area 4x”, and we have [AMB] = [AND] = x? and [CNM] = - We therefore 


2X M 
have [AMN] = 4x? — x? — x? — x = 3E, We have AN = AM = x V5 from the Pythagorean 
Theorem, so i 
5x? 


1 . 
[AMN] = zAN)AM) sina = -> sina. A Ox B 


Therefore, we must have bx sina = ar so sina = 3. Once again, we have used area to tackle a problem that 


doesn’t appear to be about area. 


Notice that our answer doesn’t depend on x. Because scaling the diagram up or down doesn’t affect ZMAN, 
we could have chosen a specific value for AB rather than letting it be x. So, we could have simply let AB = AD = 2 
and proceeded as above to find sina. 0 


In our solutions to Problem 4.7, we discovered an angle with a sine of 3/5. One of 
the acute angles of a 3-4-5 triangle has sine equal to 3/5. Perhaps we could solve 
the problem by finding a 3-4-5 triangle with ZMAN as one of its acute angles. See 
if you can find it with a little “outside-the-box” thinking. Answer on page 153. 


| Exercises E 


4.1.1 If ZPQR = 90°, ZORP = 42°, and QP = 8, then what are PR and QR to the nearest hundredth? 


Sidenote: 


4.1.2 If XY =4YZ and /XYZ = 90°, then what is ZXZY to the nearest degree? 


4.1.3 Find the measure of an acute angle formed by the x-axis and the graph of 2x + 3y = 7. (Answer to the 
nearest degree.) 
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4.1.4 Find the area of AABC if AB = 4BC = 12 and ZABC = 30°. 


4.1.5 I'm standing at the peak of a mountain that is 14,000 feet above sea level. The angle of depression from this 
peak to a nearby smaller peak is 4°. On my map, these two peaks are represented by points that are 1 inch apart. 
If each inch on my map represents 1.2 miles, and there are 5280 feet in a mile, then how many feet above sea level 
is the second peak? 


4.1.6 The radius of the circle at right is 1. For each of the following trigonometric 
expressions, find a segment in the diagram that has length equal to the trigono- 
metric expression: sin 0, cos 0, sec 0, csc 0, tan 0, cot 0. (Note: You are not asked 
to express each trig function in terms of multiple segments in the diagram; you 
must find a segment whose length equals the corresponding trig function.) 


4.1.7 A, B, and C are vertices of a cube such that AB is an interior diagonal of the 
cube and AC is a diagonal of a face of the cube. Find cos ZABC. 


4.1.8x Square ABCD has center O and AB = 900. Points E and F are on AB with 
AE < BF and E between A and F such that ZEOF = 45° and EF = 400. Find BF. (Source: AIME) 


4.2 Law of Cosines 


Problems l at 


Problem 4.8: In AABC, let AC = 14, BC = 12, and ZC = 34°. In this problem, we find AB to the nearest hundredth. 
(a) Draw altitude BX from B to AC. Find CX and BX to the nearest hundredth. 


(b) Find XA to the nearest hundredth. 
(c) Use parts (a) and (b) to find AB to the nearest hundredth. 


Problem 4.9: Let AABC be an acute triangle with a = BC, b = AC, and c = AB. Use Problem 4.8 as a guide to 
prove that 


E =g +b? —2abcosC. 


Problem 4.10: An airplane leaves an aircraft carrier and flies due south at 400 km/h. The carrier proceeds 60° 
east of north at 32 km/h. If the plane has enough fuel for 5 hours of flying, what is the maximum distance south 
the pilot can travel, so that the fuel remaining will allow a safe return to the carrier? (You may assume Earth is 
flat in this problem.) (Source: CEMC) 


Problem 4.11: In AABC, we have AB = 5, BC = 7, and AC = 8. Find ZBAC. 


Problem 4.12: Equilateral triangle ABC has been creased and folded so that vertex A now 
rests at A’ on BC as shown. If BA’ = 1 and A’C = 2, then what is the length of crease PQ? fae 
(Source: AMC 12) a 


(a) Why do 60° and 120° angles suggest trying the Law of Cosines? Q 
(b) Find PQ. Hints: 228 


Problem 4.13: In quadrilateral ABCD, we have ZA = ZC, AB = CD = 180, and AD # BC. The perimeter of ABCD 
is 640. Find cos A. (Source: AIME) Hints: 175 ; 
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Problem 4.8: In AABC, let AC = 14, BC = 12, and ZC = 34°. Find AB to the nearest hundredth. 


Solution for Problem 4.8: We know how to use trigonometry to find side lengths in right B 
triangles, so we start by drawing an altitude from B to AC as shown. This creates right 
triangle ABCX with the 34° angle as one of its acute angles. From right triangle ACBX, 12 
we have 
= = sin C ~ 0.559, C A 
i 


so BX x 0.559(BC) = 6.71. Similarly, we have CX/BC = cosC ~ 0.829, so CX ~ 9.95. 
This doesn’t tell us AB yet, but we now have the length of one leg of ABXA. If we can find the other, we can use 
the Pythagorean Theorem to find AB. Fortunately, XA is easy to find: XA = AC — CX = 4.05. Now, we can use the 


Pythagorean Theorem to find 
AB = VBX? + XA? = 7.84. 
m 


In Problem 4.8, we were given two side lengths of a triangle and the measure of the angle between these two 
sides. We then found the third side. There was nothing particularly special about the side lengths or the angle. 
We might be able to follow essentially the same process for any triangle. Let’s give it a try. 


Problem 4.9: Let AABC be an acute triangle with a = BC, b = AC, and c = AB. Find a formula for c in terms of 


a, b, and ZC. 


Solution for Problem 4.9: We use Problem 4.8 as a guide. In fact, this problem is B 
essentially the same as Problem 4.8, but with variables a, b, and ZC in place of the 
numbers that were given in the earlier problem. We can use the same steps. We draw 
altitude BX from B to AC. Then, we have sinC = BX/BC, so BX = BCsinC = asinC. 
We also have cosC = CX/BC, so CX = BCcosC = acosC. Therefore, we have 
AX = AC- CX = b-acosC. Next we apply the Pythagorean Theorem to AABX ç A 
to find AB? = BX? + AX’. Substituting our expressions for these three sides gives us 28 


c = a° sin? C + (b — a cos C}? 
= @ sin? C + b* — 2ab cos C + a? cos? C 


= @?(sin? C + cos? C) + b? — 2ab cos C. 


Since sin? C + cos? C = 1, we have 
=a + b —2abcosC. 


Oo 


We can test our new formula with the data from Problem 4.8. 


Once you think you’ve found a new formula, check your work by trying the 
formula on specific examples you have solved without the formula. 


Concept: 


O= 


In Problem 4.8, we have a = 12, b = 14, and ZC = 34°, so we have 
c= @ + b? —2abcosC x 61.44. 


Taking the square root of both sides gives c ~ 7.84, which agrees with our answer from Problem 4.8. 


With a little more casework (which you'll supply as an Exercise), we can show that this equation holds for any 
triangle ABC. 
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Important:  Leta= BC, b= NC and c = AB in AABC. The Law of Cosines states that 


C=aa + i — 2ab cosC. 


Notice that when ZC = 90°, we have cos C = 0, so the Law of Cosines becomes c? = a? + b*, which is just the 
Pythagorean Theorem. 
Problem 4.8 is just a specific example of Problem 4.9, so we call Problem 4.9 a generalization of Problem 4.8. 


Solving a specific example of a general problem can often provide a guide for 
solving the general problem. 


Problem 4.10: An airplane leaves an aircraft carrier and flies due south at 400 km/h. The carrier proceeds 
60° east of north at 32 km/h. If the plane has enough fuel for 5 hours of flying, what is the maximum distance 


south the pilot can travel, so that the fuel remaining will allow a safe return to the carrier? (You may assume 
Earth is fla 


Solution for Problem 4.10: We start with a diagram, including the path of the ship and the path S 
of the plane. The plane leaves the ship at L, flies south to T before turning, and then flies AO 
to S, where it lands on the ship. Because the plane flies south and the ship goes 60° east of L 
north, we have ZTLS = 180° — 60° = 120°. If the plane flies for 5 hours, then the ship moves 

32-5 = 160 km and the plane flies 400 - 5 = 2000 km. Therefore, LS = 160 and LT + TS = 2000. 

We let the desired distance, LT, be x, so TS = 2000 — x, as shown. We have expressions for all 2000 — x 
three sides of ALST, and we know one angle, so we apply the Law of Cosines: x 


TS? = LT? + LF = 2(LT)(LS) cos ZTLS. 
Inserting our expressions and values for lengths and the angle, we have 


(2000 — x)? = x? + 160° — 2(x)(160) cos 120°. 


T 
We have cos 120° = —4 (which is one reason the Law of Cosines is often helpful in problems involving 120° angles), 
and expanding the left side of the equation gives 
2000? — 4000x + x? = x° + 160° + 160x. 


Solving this equation gives x ~ 955, so the plane can fly approximately 955 km south. O 


Problem 4.11: In AABC, we have AB = 5, BC = 7, and AC = 8. Find ZBAC. 


Solution for Problem 4.11: We are given all three side lengths of the triangle, so we have all the information in the 
Law of Cosines except the angle measure. Therefore, we can solve for cos ZBAC with the Law of Cosines. We have 


BC? = AB? + AC? — 2(AB)(AC) cos ZBAC, 


so 
AB? + AC? — BC? _ 25+64-49 1 
2(AB)(AC) 25X8 72 


Since 0° < ZBAC < 180°, cos ZBAC = 3 gives us BAC = 60°. o 


cos ZBAC = 


| Concept: The Law of Cosines can be used to find angle measures as well as side lengths. 
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Problem 4.12: Equilateral triangle ABC has been creased and folded so that vertex A now 
rests at A’ on BC as shown. If BA’ = 1 and A’C = 2, then what is the length of crease PQ? 
| (Source: AMC 12) 


Solution for Problem 4.12: We know that the side length of the original equilateral triangle is BA’ + A’C = 3, but 
we can’t immediately find any other lengths. So, we assign a variable and hope to express other side lengths in 
terms of that variable. We might start by letting z = PQ, but we don’t see any way to express another side length 
in terms of z. Therefore, we start by letting x = BP, which gives us AP = AB— BP = 3— x. But now we seem stuck. 


| Concept: | When stuck ona problem, focus on information you haven’t used yet. | 


We haven't used the fact that APA’Q is the reflection of APAQ over PQ. This tells us that PA’ = PA, so 
PA’ =3-x. Now, we have expressions for the lengths of all three sides of APA’B. We also know that PBA’ = 60°, 
so we apply the Law of Cosines: 


(PA’)? = BP? + (BA’)? — 2(BP)(BA’) cos ZPBA’, 


10) 
(3 — x}? =x? + 1 — 2x cos 60°. 


Since cos 60° = 4, we have (3 — x)? = x? — x + 1. Expanding the left side gives 9 — 6x + x? = x? — x + 1, from which 
we find x = 2, Therefore, we have BP = 3 and PA’ = PA=3-BP= = 


We can do the same thing with AC! 


Concept: If a tactic gives some information in a problem, but doesn’t completely solve it, 
O= then try applying that tactic to the problem again in another way. 


We let QC = y, so A’Q = AQ = 3 — y. Applying the Law of Cosines to AA’QC gives 
(A’Q)? = (A’C + QC? — 2(A’C)(QC) cos ZA’CQ, 


so 
(3 — y}? = 4 + y? — 2(2)(y) cos 60°. 


Solving this equation for y gives us y = Ẹ, so QC = f and A'Q =AQ=3-QC= į. 
One more time! We have AP = 2, AQ = Z, and ZPAQ = 60°, so we apply the Law of Cosines to find 


2 = AP? + AQ? — 2(AP)(A ZPA n ae (=) (5) 
49 49 49  49-16+49-25-49-20 _ 49-21 


Tis eaa 400 = -500 ` 


Taking the square root gives PQ = 7 vV21/20. a 


Each of the previous three problems featured a 60° or 120° angle. Usually, when we see 60° or 120°, our first 
strategy is to look for equilateral triangles or 30-60-90 triangles. But since cos 60° = 5 and cos 120° = —$, the Law 
of Cosines has a particularly simple form when the angle is 60° or 120°. 
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‘Concept: If a geometry problem contains a 60° or 120° angle, you may be able to tackle it 


O= with the Law of Cosines. 


: : s Gor e 


 WARNING!! The Law of Cosines can lead to some pretty heavy algebra, so usually we 
i re look for equilateral triangles or 30-60-90 triangles before pulling out the Law 


of Cosines on a problem that involves 60° or 120° angles. . | 


Problem 4.13: In quadrilateral . ABCD, we have ZA = ZC, AB = CD = 180, and AD + BC. The perimeter of J 


ABCD is 640. Find cos A. oer onn 


Solution for Problem 4.13: That the question asks for the cosine of an angle suggests that D 

we either build right triangles or we apply the Law of Cosines. But we don’t have any a \ 

triangles. So, we make some. The obvious candidate for building triangles is to draw 

diagonals. We don’t want to draw diagonal AC, since that will break up two angles C 
that we know are equal, and we’ll probably have to use that angle equality. So, we try A B 


drawing BD first, thereby creating two triangles that share a side and have equal angles. We apply the Law of 
Cosines to each, using ZA in AABD and ZC in ACBD: 


BD? = AB? + AD? — 2(AB)(AD) cos A, 
BD? = CB? + CD? — 2(CB)X(CD) cos C. 
Setting our two expressions for BD? equal, and noting that cos C = cos A because ZA = CC, gives us 
AB? + AD? — 2(AB)(AD) cos A = CB? + CD? — 2(CB)(CD) cos A. 
We also have AB = CD = 180, so AB? = CD*, and the equation is now 
D? — 2(180)(AD) cos A = CB? — 2(180)(CB) cos A. 
To solve for cos A, we group the cosA terms on the left, and we have 
2(180)(CB) cos A — 2(180)(AD) cos A = CB? — AD’, 
so 
2(180)(CB — AD)’ 


(This step only avoids dividing by 0 because we are told that AD # BC. This gives us some confidence that we’re 
on the right track.) Factoring the numerator of the right side as a difference of squares, we have 


cosA = 


(CB - AD)\(CB+AD) _ CB+AD 
2(180(CB—AD) 2(180) ` 


cosA = 


Now, we just have to find CB + AD. We go back to the problem, and look for information we haven't used yet. 
We haven't used the perimeter. That involves the sum of sides! Let’s try it. We have AB + CD + CB + AD = 640 
and AB = CD = 180, so CB + AD = 640 — 180 — 180 = 280, and we have 


cosA = CE TAD _ 280 _ 280 _7 
= 2(180) ~ 2(180) 360 9° 
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ee pa S T S E R A o E a 


|| __Exercises EEA 


4.2.1 Complete our proof of the Law of Cosines by proving it for obtuse and right triangles. 


4.2.2 Two airplanes take off from the same airport at the same time. One flies due west at 200 miles per hour. 
The other flies 40° north of due east at 250 miles per hour. To the nearest mile, how many miles apart are the 
planes 90 minutes after takeoff? 


4.2.3 Prove each of the following: 


(a) If AB? = BC? + CA?, then ZACB is right. 
(b) If AB* > BC? + CA?, then ZACB is obtuse. 
(c) If AB? < BC? + CA?, then ZACB is acute. 


In each case, is the converse of the given statement true? Why or why not? (The converse of a statement of 
the form “If X, then Y” is “If Y, then X.”) 


4.2.4 One side of a triangle has length twice that of another side, and the third side has length 6. If one angle of 
the triangle is 120°, then what are the possible values of the lengths of the sides of the triangle? 


4.2.5 If XY =3, YZ =5, and ZX = 7, then what is ZXYZ? 


4.2.6 Quadrilateral ABCD is inscribed in a circle. If AB = 2, BC = 3, CD = 4, and DA = 6, then what is AC? 
(Source: CEMC) 


B with radius 3 is internally tangent to circle A at one vertex of J. Circles C and 
D, both with radius 2, are internally tangent to circle A at the other two vertices 
of J. Circles B, C, and D are all externally tangent to circle E. Find the radius of 
circle E. (Source: AIME) 


4.2.7x Equilateral triangle T is inscribed in circle A, which has radius 10. Circle A D 


4.2.8x Triangle ABC has a right angle at B and contains a point P for which 
PA = 10, PB = 6,and ZAPB = ZBPC = ZCPA. Find PC. (Source: AIME) Hints: 205 (ee 
N 
n 


A 


See 


4.3 Law of Sines 


We start with an important fact from elementary geometry that will be helpful in this section. A B 


In the diagram at right, we say that ZBAC is inscribed in arc BC of the circle because A, B, and 
C are all on the circle. The measure of an inscribed angle equals half the measure of the arc 


it intercepts, so ZA = 2E. If you are not familiar with this relationship, try proving it yourself 
before looking it up in a geometry text or online. (This relationship, and many others involving 
angles and circles, is covered in Art of Problem Solving’s Introduction to Geometry.) 


|| __Probiems ig 


Problem 4.14: In APQR, we have PR = 12, ZQPR = 66°, and ZPRQ = 63°. In this problem, we find PQ and QR. 
(a) Let T be the foot of the altitude from P to OR. Find PT to the nearest hundredth. 


(b) Use PT to find PQ to the nearest hundredth. 
(c) Find QR to the nearest hundredth. 


=) 
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Problem 4.15: Suppose that AABC is an acute triangle with a = BC, b = AC, and c = AB. Prove that 


moo b T e 
sinA sinB sinC’ 


Problem 4.16: Let points P and Q be points on a shoreline that are 1.5 miles apart. Let B be the base of a 
radio tower on a distant island, and T be the top of the tower. A surveyor uses an angle measurement tool to 
determine that ZPQB = 58° and ZQPB = 72°. She also measures that the angle of elevation from P to the top of 
the tower is 1°. In this problem, we find the height of the tower in feet. 

(a) Find PB. 


(b) How tall is the tower in feet? 


Problem 4.17: In this problem, we prove the Extended Law of Sines, which states that if a = BC, b = AC, c = AB, 
and R is the radius of circumcircle of AABC, then 


a b 


snA sinB 


(Note: The circumcircle of a triangle is the circle that passes through all three vertices of the triangle.) 
(a) Start with acute triangle AABC. Let D be the point on the circumcircle such that AD is a diameter of the 
circle. What angle of AABC equals ADC, and why must these angles be equal? Hints: 229 


(b) Show that PE = 2R, where R is the circumradius of AABC. 
sin ZAB 


(c) Prove that the Extended Law of Sines holds for right and obtuse triangles. 


Problem 4.18: — 7 
(a) Let AABC and ADEF be triangles such that AB = DE = 7, BC = EF = 5, and 4CAB = ZFDE = 30°. Must 
sin ZACB = sin DFE? 


(b) Must AABC and ADEF be congruent? If not, can we deduce anything about the relationship between 
ZACB and ZDFE? 


Problem 4.19: In triangle ABC, AB = 13, BC = 15, and CA = 14. Point D is on BC with CD = 6. Point E is on BC 
such that CE > CD and ZBAE = ZCAD. Find BE. (Source: AIME) Hints: 241, 179 


Problem 4.14: Suppose PR = 12, ZQPR = 66°, and ZPRQ = 63° in APQR. Find PQ and RQ to the nearest | 
hundredth. 


Solution for Problem 4.14: We start by building a right triangle, since we know Ip 

how to use trigonometry to find lengths of sides in a right triangle. We draw 

altitude PT to side QR. From right triangle APTR, we have PT/PR = sinR, so 

PT = PRsinR 7 10.69. We also have TR/PR = cos R, so TR = PR cos R ~ 5.45. 12 


Now, we can use right triangle APQT to find lengths QT and PQ. First, we 
find that ZQ = 180° — 66° — 63° = 51°. We have PT/PQ = sinQ, so we have 
PQ = PT/(sin Q) ~ 13.76. We also have PT/QT = tan Q, so QT = PT/(tan Q) ~ 8.66. Q I IK 
Finally, we have QR = QT + TR = 14.11. O 


Let’s try using our approach in Problem 4.14 to produce another “law.” 
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Problem 4.15: Suppose that AABC is an acute triangle with a = BC, b = AC, and c = AB. Prove that 


a b c 
sinA sinB sinC’ 


Solution for Problem 4.15: We use Problem 4.14 as a guide. We draw altitude AT to A 

BC. From right triangle AATC, we have sin C = AT/AC, so AT = ACsinC = bsinC. 

From right triangle AABT, we have sin B = AT/AB,so AT = ABsinB = csin B. These 

two expressions for AT must be equal, so we have bsinC = csinB. Dividing this b c 
equation by sin B and by sin C, we have 


b mee 
sinB sinC’ 


We can follow essentially the same steps starting with the altitude from C to AB to show that b/(sin B) = a/(sin A). 
So, we have 


Important: Leta = BC, b = AC, and c = AB in AABC. The Law of Sines states that 


a b c 


sinA sinB sinC’ 


We'll tackle the cases in which AABC is right or obtuse in Problem 4.17. 


Problem 4.16: Let points P and Q be points on a shoreline that are 1.5 miles apart. Let B be the base of a 
radio tower on a distant island, and T be the top of the tower. A surveyor uses an angle measurement tool to 
determine that ZPQB = 58° and ZQPB = 72°. She also measures that the angle of elevation from P to the top 
of the tower is 1°. How tall is the tower in feet? (There are 5280 feet in a mile.) 


Solution for Problem 4.16: We first consider ABPT at right. We want BT, the height of the T 
lighthouse, and the surveyor found that ZBPT = 1°. If we could find any of the lengths of a 
the sides of ABPT, we could use trigonometry to find the remaining sides. Unfortunately, C 
we don’t yet know any of the sides of this triangle, so we turn to what we know about point P B 


Q. 


We are given two angle measures and a side length of APQB, so we can find the third angle, Q 
and then use the Law of Sines to find the other sides. We have ZPBQ = 180° — 58° — 72° = 50°. 
The side length we’re most interested in is PB, since we can use that to find BT in the right 58° 
: : 1.5 
triangle above. Applying the Law of Sines to APQB gives 


PBs PQ > 
sinZPQB sin ZQBP’ P B 


a _ PQsinZPQB _ 15sin58° _ 


= = 1.66. 
sin ZOBP sin 50° 
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Now we can find the height of the tower. In right triangle BPT, we have tan ZBPT = 85. T 
Since PB ~ 1.66 miles and ZBPT = 1°, we have BT = PB tan ZBPT ~ 0.0290 miles. There are | 
5280 feet in one mile, so the height of the tower is 0.0290 - 5280 ~ 153 feet. O a 

í P 


| Problem 4.17: Prove the Extended Law of Sines, which states that if a = BC, b = AC, c = AB, and R is the 
| radius of the circumcircle of AABC, then 


@ á oe 


Sine ZI 


sinA sinB 


(Note: The circumcircle of a triangle is the circle that passes through all three vertices of the triangle.) 


Solution for Problem 4.17: If AABC is acute, then we already know that the first three B 

ratios are equal, so we only HEME to prove that ene» of them equals 2R. To get a 

clue where to start, we rewrite =< = 2R as sinC = 3g. This suggests that we seek 

a right triangle with c as one ee E and 2R as ie length of the hypotenuse. 

Therefore, we locate point D on the circle such that AD is a diameter (which has D A 
length 2R), and we draw AD and BD. Since ZABD is inscribed in a semicircle, we 

have ZABD = 90°, so AABD isa eas triangle with leg AB = c and hypotenuse 

AD = 2R. Then, we have sin ZADB = = zk, which means we only have to show € 

that sin ZADB = sin ZC to finish. sie ‘ADB and ZC are inscribed in the cls arc 

of the circle, these two angles are equal, which means sin C = sin ZADB = 3g, as desired. 


Unfortunately, we're not quite finished. Our proof above assumes that ZC is acute. We have to tackle the cases 
of right angles and obtuse angles separately. 


The proof is pretty straightforward for right angles; we can even use the diagram above. ADB is a right triangle 
with ZABD = oe and the circumradius of the triangle is R = AD/2, so AD = Es Since sin ABD = sin90° = 1, 
we also have =-42. = AD. Setting our two expressions for AD equal gives =-42., = 2R. 


If ZC is obtuse, our first step is still to build right triangle ADB, as shown at right. B 
As before, we have sin ZADB = AB = 57 however, we clearly don’t have ZADB = ZC 
in this case. But we don’t need the angles to be equal! We only need their sines to be 
equal to be able to conclude that sin C = 5. Supplementary angles (angles that sum to D A 
180°) have equal sines, so we wonder if ZC and ZADB are supplementary. The measure 
of an angle inscribed in an arc of a circle is half the measure of the arc, so we have 


ZADB + /C = ACB A ADB M ACB + ADB 
2 2 2 

Putting the arcs ACB and ADB together gives us the whole circle, so the sum of their measures is 360°. Therefore, 

we have ZADB + ZC = 180°, so ZADB = 180° — ZC, and sin C = sin(180° — ZC) = sin ZADB = 55, as desired. O 


Important: Let a = BC, b= ‘AC, and c = AB in AABC, and let R be the radius of the 
circumcircle of AABC (that is, the radius of the circle that passes through all 
three vertices of AABC). The Extended Law of Sines tells us that 


a b c 


snA sinB sinc ~~ 
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Problem 4.18: aj 
(a) Let AABC and ADEF be triangles such that AB = DE = 7, BC = EF = 5, and ZCAB = ZFDE = 30°. Must 


sin ZACB = sin DFE? 


(b) Must AABC and ADEF be congruent? If not, can we deduce anything about the relationship between 
ZACB and £DFE? 


Solution for Problem 4.18: 
(a) Applying the Law of Sines to AABC gives 


AB oC a Oman aa 
sinZACB  sinZCAB sin30°. ` 
Therefore, we have 
ABR 7 
in ACB = — = —. 
ale foetal E A 
Similarly, applying the Law of Sines to ADEF gives 
DS ES ii 
sin /DFE sin/FDE  sin30e ” 
sis DEM 
in ZDFE = —— = —. 
ee 10 10 


Therefore, we must have sin ZACB = sin ZDFE. 


(b) If AABC = ADEF, then we must have ZACB = ZDFE. At this point, all we know is that the sine of each of 
these angles equals 5 However, there are two angles between 0° and 180° whose sine equals 4, one acute 
angle and one obtuse angle. We can find these angles with a calculator. The acute angle is approximately 
44.4°, and the obtuse angle is 180° — 44.4° = 135.6°, since sinx = sin(180° — x). We aren’t given any 
information in the problem that prevents ZACB or ZDFE from taking on either of these measures. 


We might have ZACB = 44.4° while ZDFE = 135.6. Then, we have C 
ZABC = 180° —44.4° -30° = 105.6°, while ZDEF = 180° —135.6° -30° = 
14.4°. In this case, the triangles are clearly not congruent. (The two F 
triangles are drawn to scale at right.) All we can say is that if the 3 
triangles are not congruent, then ZACB + ZDFE = 180°, since the sines A B D E 


of these two angles are equal but the angles are not equal. 


Sidenote: You probably learned various congruence theorems while studying geometry, such 
as Side-Angle-Side Congruence and Side-Side-Side Congruence. Problem 4.18 
explains why there is no Side-Side-Angle Congruence Theorem. That is, if two 
sides and an angle other than the angle between these sides in one triangle are 
congruent to the corresponding sides and angle of another triangle, we cannot 
always conclude that the two triangles are congruent. (You'll see a special case in| 
the Exercises in which Side-Side-Angle is enough to establish the congruence of 
two triangles, but usually it is not enough.) 


Problem 4.19: In triangle ABC, AB = 13, BC = 15, and CA = 14. Point D is on BC with CD = 6. Point E is on 


BC such that CE > CD and ZBAE = ZCAD. Find BE. (Source: AIME) 
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Solution for Problem 4.19: We let 6 be the measure of equal angles ZBAE and CAD. We 
need to use this angle equality, but we don’t see any likely candidates for congruent 
triangles. We do have some equal side lengths, so perhaps the Law of Sines will 
allow us to use the angle equality somehow. Applying the Law of Sines to AACD 
and AABE gives: 


BE _ AE AB CD AD AC 


FG Shih a RID Ce Gh ne 


(4.1) 


na = aT: by $ mao = F = oe 
BE _ AE  sinC 
CD AD sinB’ 

We know CD, but we don’t know any of the lengths on the right side. While we don’t know the sines on the right, 


ere ratio suggests elas the Law of Sines to AABC, since we know the side lengths of AABC. This gives us 
= AC so ŠE = 42 and we know AB and AC. So, we now have 


anc = sinB” sinB =e, 


cancels out sin 8 and leaves 


Dividing 4 


AE sinC AE AB 
AD aa | (Meee eS) 


We know three of the lengths on the right side, but not AE or AD. Let’s take a look at the third ratio in each of the 


E : 3 Í a 19 BE _ __AB SOE SING i 
original Law of Sines equations we found on line (4.1). Dividing -5 = =4a¢m by sag = axzapc Bives 


BE = 


BE 7 AB sin ZADC 
CD AC sinZAEB’ 


Solving for BE gives us 


AB sinZADC 
AC sin ZAEB — 
sin ZADC 


This looks familiar. The only difference between this and our final result on line (4.2) is that here we have £7455 
in place of 45. The Law of Sines together with the identity sinx = sin(180° — x) explains why these are equal. 
Applying the Law of Sines to AAED gives +455. = =-49.., so we have 


BES 


AE _ sinZADE _ sin(180°-ZADC) _ sin ADC 
AD sinZAED  sin(180° — ZAEB) sin ZAEB’ 


This brings us right back to Equation (4.2). 


Concept: Most complex problems can be solved in more than one way. So, don’t get trapped 
Q= _ into thinking that you need to find the way to solve the problem. | 


We still have to find 45 (or the corresponding ratio of sines). So far, we've made progress by focusing on each 
of the three little triangles, and on the large triangle AABC. The only triangles we haven't used yet are AADB and 
AAEC. Let's try the Law of Sines on these. 


Concept: When stuck on a problem, experiment with parts of the problem you haven't 


O= used. 
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The Law of Sines applied to AAEC and AADB gives us 


A 
AED EC meee Te IND BD AB AN 
sinC sin/EAC  sinZAEC sinB sin/BAD  sinZADB` 


We start with the first two ratios in these equalities, because ZEAC = ZBAD (since i 1 
both equal 0 + ZDAE). Dividing 2%. = fy by 43 = — 5) gives 

AE sinB EC _ sinZBAD 

AD smC 7 BD sin ZEAC wee 
We saw earlier that $£Ẹ = 48, and ZBAD = ZEAC means the sines on the right cancel. ._,_}————___——__——_} 
So, now we have 15 


Success! We are given AB and AC, and we have BD = BC — CD = 9 and EC = BC - BE = 15 — BE. So, going 
back to our earlier expression for BE in Equation (4.2), we have 


_ AE AB EC GAB 15-BE (13\? 
Sag m yc te i 


2 
Solving BE = 15E . (13). 6 gives BE = 2535/463. D 


Exercises p 


4.3.1 My house is on one side of a river and my neighbor’s house is on the other. I’d like to figure out how far 
away his house is from mine, but I don’t want to get wet, and we don’t have a boat or a bridge. There’s a well on 
my property that is 150 feet from my house. If I stand at my house and face the well, I must then turn 42° to my 
left to be facing my neighbor’s house. If I stand at the well and face my house, I must turn 123° to my right to face 
my neighbor’s house. To the nearest foot, what is the distance between my house and my neighbor’s house? 


4.3.2 The Leaning Tower of Alpine has been constructed 5° from perfectly vertical. When it casts a 20 foot 
shadow in the direction directly opposite the direction it leans, the angle of elevation from the tip of the shadow 
to the sun is 72°. How far is the top of the tower from the ground (to the nearest foot)? 


4.3.3 Prove that if AB > AC > BC in AABC, then ZC > ZB > ZA. Is the converse true? 


4.3.4 The perimeter of a triangle is numerically 12 times the average of the sines of the angles of the triangle. If 
one side of the triangle has length 2, what are the possible measures of the angle opposite that side? 


4.3.5 In Problem 4.18, we encountered AABC and ADEF such that AB = DE = 7, BC = EF = 5, and ZCAB = 
ZFDE = 30°, and showed that these two triangles need not be congruent. When I first created this problem 
to demonstrate why there is no “Side-Side-Angle Congruence Theorem,” I had the 5 and 7 reversed, so that 
AB = DE = 5, BC = EF = 7, and ZCAB = ZFDE = 30°. Why did I have to change it? 


4.3.6 What’s wrong with this “proof” of the Angle-Angle Similarity Theorem: 


Suppose we have AABC and ADEF with ZA = ZD and ZB = ZE. The Law of Sines gives us 


cae. Y: EF _ DE 
sinA sinB sinD  sinE 
Therefore, we have BC/AC = (sin A)/(sin B) and EF/DF = (sin D)/(sin E). Since ZA = ZD and ZB = ZE, 
we have (sin A)/(sin B) = (sin D)/(sin E). So, we have 
BC  sinA _ sinD _ EF 
AC sinB sinE DF 
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Because ZA = ZD and ZB = ZE, we have ZC = ZF. We can follow essentially the same steps as above to 
deduce that BE : AC : AB = EF : DF : DE. 


4.3.7x Distinct points A and B are on a semicircle with diameter MN and center C. The point P is on CN and 
ZCAP = ZCBP = 10°. If MA = 40°, then what is BN? (Source: AHSME) 


4.4 More Triangle Relationships 


In addition to the Law of Sines and the Law of Cosines, there are many other triangle relationships that involve 
trigonometric functions of the angles of a triangle, or that can be proved using trig. In this section, we discover 
some of these relationships. More importantly, we learn how to develop them—there are far too many of these 
relationships to memorize them all. Instead, we learn how to derive them, so we can do so whenever we need 
them. 


Before we get started, we’ll review some notation we typically use with triangles, definitions of some of the 
key parts of a triangle, and a list of some of the basic relationships we will use. 


We let the side lengths of AABC be a = BC, b = AC, and c = AB. We denote the area of AABC as [ABC]. The 
semiperimeter of a triangle is half the triangle’s perimeter, and we usually use s to denote the semiperimeter. The 
circle that passes through all three vertices of AABC is called the circumcircle of AABC and its radius is called the 
circumradius of the triangle. The circle that is tangent to all three sides of a triangle is the incircle of the triangle; 
its radius is the inradius of the triangle. We usually use R for the circumradius of a triangle and r for the inradius 
of the triangle. 


Two formulas that we proved in Art of Problem Solving’s Introduction to Geometry that C 
will be useful in this section are: r 
= D 
e Let F be the point where the incircle of AABC is tangent to side AB. Then, we 
have AF = s — a and BF = s — b. 
e [ABC] = rs. A E B 


You'll have a chance to prove these as Exercises. 


Problems > 
l abc a 
Problem 4.20: Show that [ABC] = IR Hints: 194 


Problem 4.21: Show that b = c cos A + a cos C in acute triangle ABC. Hints: 280 


A P A s(s — a) 
2 


Problem 4.22: Show that tan — = er? and cos —- = 4/ ————. Hints: 176 


2 bc 


Problem 4.23: Show that sin A + sin B + sinC = % in any triangle ABC. 


Problem 4.24: Show that in any non-right triangle ABC, we have tan A + tanB + tanC = tan Atan B tan C. 


Hints: 203 
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Problem 4.25: In this problem, we prove Stewart's Theorem. Let point D be on side BC, and let d = AD, m = BD, 
and n = CD. Stewart’s Theorem states that 


a(d + mn) = mb? + nc’. 


(a) Which of the triangle relationships involving lengths is most likely to produce expressions like those in 


Stewart’s Theorem? 


(b) Apply your answer to part (a) to triangles ABD and ACD. Which angles of these two triangles will be 
most convenient to focus on, and why? 


(c) Eliminate the trig functions from your results in part (b) and perform some clever algebra to prove that 
ald? + mn) = mb? + ne. 


‘Problem 4.20: Show that [ABC) = = | 


Solution for Problem 4.20: Rather than start from scratch with a diagram, we try using formulas we already know. 


Concept: Many triangle formulas can be proved by combining formulas you already know. 


We look at the key features of “. First, we have a product of side lengths in the numerator. We already have 
an area formula with a product of side lengths: 


[ABC] = ab sin C. 


We also already have a oe involving R, the Extended Law of Sines, which tells us that 2R = ṣf¢. Solving 
this for sin C gives sin C = 52, and substituting this into [ABC] = 5ab sin C gives [ABC] = abe m 


Problem 4.21: Show that b = c cos A + a cos C in acute triangle ABC. 


Solution for Problem 4.21: We present two solutions: 


Solution 1: Geometry. The two cosines suggest that we look for right triangles, one with B 


ZA as one of the acute angles and one with Drawing altitude BX accomplishes both. 


From right triangle BXA, we have cos A = 4% = 4*, so AX = ccos A. Similarly, we have 


CX = acos C. Therefore, we have 
b = AC = AX + CX = ccos A + a cos C. 
A Xa G 
Solution 2: Formula Bashing. We already have a formula that relates the cosine of an angle of a triangle to the . 
side lengths of the triangle: the Law of Cosines. Solving for the cosine in the Law of Cosines gives 
b? +e a? 
2bc ` 


Doing the same for cos C and substituting into c cos A + a cos C gives 


cos Á = 


” C +c -a | a +b- b4+ce-at4+a?+h—c2 A 
+acosC e ———__ +4: a a = b. 
ee 2be 2ab 2b 


a A —es__ TT DT 
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(Note that our second solution shows that this relationship holds for right and obtuse triangles, too.) O 


Problem 4.22: Show that tan 2 7 e and cos 2 = 4/ = 


2 s-a 2 


Solution for Problem 4.22: We'll start with the formula for tan $. Angle bisectors produce € 

angles that are half the angles of the triangle, and the angle bisectors intersect at the center 

of the incircle. Because the incircle is tangent to all three sides of the triangle, we form a 

right triangle by connecting the incenter of the triangle to a vertex and to a neighboring 

point of tangency. The first desired formula jumps right out at us now. In right triangle 

IAF, we have AF = s —a, IF = r, and ZIAF = 4, so tan $ = =. $ 
A 

Unfortunately, triangle IAF isn’t going to give us cos 4 quickly, since we don’t have a 3 

nice expression for IA. However, we can express cos 4 in terms of cos A, and we can write cos A in terms of the 

side lengths of AABC using the Law of Cosines. This plan might look daunting, but there are a couple reasons it 

also looks promising. First, the cosine half-angle formula introduces a square root, which we need. Second, the 

numerator of the desired expression has squared side lengths when we expand s(s — a), and the Law of Cosines 

had squared side lengths. Applying the half-angle formula and then the Law of Cosines gives: 


A 1+cosA 1+ e 
cos = = 4/ ——— = {/ ———*— 
2 2 2 
o Aee Cad OTO ae 


Abc 


“Vato 
_ ,/@rerab+c—a) _ | /(2s)(2s— 2a) _ zZ 
j Abc =E RE a 


aan e show that sineesing eeincG= — 


R 


in any triangle ABC. 


Solution for Problem 4.23: Sines, side lengths, and the circumradius: this looks like a job for the Extended Law of 


Sines. Since 
a b c 


mA a Sin 


we have 7 5 
7 n y a E a+o+c S 
sinA +sinB +sinC = 52 + 55 + 5p = OR R 


(m) 


Problem 4.24: Show that in any non-right triangle ABC, we have tan A + tanB + tanC = tan A tan B tanC. 


Solution for Problem 4.24: We start by asking ourselves why it’s important that the three angles be angles of a 
non-right triangle. 


Concept: Focusing on the key restrictions ina problem often leads to the solution. | 


In a right triangle, the tangent of one of the angles is undefined, so that explains why the triangle must be 
non-right. But what’s important about A, B, and C being the angles of a triangle? The only thing we know from 
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the fact that the three angles are the angles of a triangle is A + B + C = 180°. This at least allows us to eliminate 
one of the variables in the identity. We have C = 180° — (A + B), so tanC = tan(180° — (A + B)). Applying the 
identity tan(180° — x) = — tan x gives tan C = tan(180° — (A + B)) = — tan(A + B), and now we have a clear path to 


the solution: 
tanA + tanB 
tanC = — tan(A + B) = ---_—___. 
ea au ) 1 — tan A tan B 
Note that we cannot have 1 — tan A tan B = 0 because this only occurs if tan A = <1, = cot B. This only holds in a 


triangle if A + B = 90°. However, we are told that the triangle is not a right triangle, so we cannot have A +B = 90°. 


(4.3) 


Multiplying both ends of Equation (4.3) by 1 — tan A tan B and rearranging gives the desired 


tanA + tan B + tan C = tan A tan B tan C. 


| Problem 4.25: Let point D be on side BC, and let d = AD, m = BD, and n = CD. Stewart's Theorem states that 


a(d? + mn) = mb? + nc’. 


Prove Stewart's Theorem. 


Solution for Problem 4.25: The desired equation has multiple squares of side lengths, just A 

like the Law of Cosines. We’ll likely have to apply the Law of Cosines to both AABD and 

AACD, since we need equations involving m, n, and d. We focus on ZADC and ZADB, b c 
because these angles are supplementary, which allows us to easily relate their cosines: 

cos ZADC = cos(180° — ZADB) = —cos ZADB. From the Law of Cosines, we have 


C D B 
gP +n* = fo 
ZADC = —————_ 

cos ZADC dn 7 a 

d +m- e? 
cos ZADB = o me 

Therefore, cos ZADC = — cos ZADB gives us 
P+re-P +m -c 
2dn 7 2dm 


Multiplying both sides by 2dmn gives 
dm + mn? — bm = -(d2n + mn — cn). 
Rearranging this equation gives dm+ d’n+mn?+m?n = b’m+c*n. Factoring gives d*(m+n)+mn(m+n) = b’m+cn, 


and because m + n = a, we have the desired a(d? + mn) = mb? + ne. o 


Sidenote: For some people, Stewart’s Theorem is easier to remember as 


< dad + man = bmb + cnc. 


Most of the relationships we discovered in this section are not that important, which is why they are not in 
Important boxes. However, the two key strategies we used to find them are important: 
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Concepts: = Two strategies for proving triangle identities are: 


e Use trigonometric and geometric identities that you have already proved. 
Use parts of the desired identity as clues for which known identities you 
should try to use. 


e Use geometric tools in combination with trigonometry, often by building. 


Exercises 


4.4.1 Let F be the point where the incircle of AABC is tangent to side AB. Show that AF = s — a and BF = s — b. 
4.4.2 Show that [ABC] = rs. 

4.4.3 Complete our geometric proof that b = c cos A + a cos C by addressing the cases in which AABC is right or 
obtuse. 


[ABC] 


4.4.4 Show that in AABC, we have sin A sin B sin C = SRZ ` 


4.4.5 


A 2 |(s— = 
(a) Use the identity we found for cos 7 to show that sin . = -e 


(b) Prove that [ABC] = -s(s — a)(s — b)(s = c). (This is known as Heron’s formula. We present a dazzling proof 
of Heron’s formula with complex numbers on page 484.) 


abc 
Va+b+oa—b+oatb—c\b+c—a) 


4.4.6 Let M be the midpoint of BC. Find the length of median AM in terms of a, b, and c. 


(c) Show that R = 


4.4.7x Let AABC be an acute triangle whose altitudes meet at point H. Show that AH = 2R cos A. Hints: 279 


an 428 =h 
4.4.8x Show that in AABC, if a = BC and b = AC, then we have — 5 = = This is sometimes called the 
ame op b 


Law of Tangents. Hints: 30 


4.5 Summary 


If triangle XYZ is a right triangle with ¿XYZ = 90° and X 


l Important: 
; ZX = 0, then 


; YZ ERO -YZ 
sind = Xz’ cos 0 = Xz’ tan 0 = XY" 


Another way of viewing this is: If @ is the measure of y 
an acute angle in a right triangle, then we have 


dig SPROUTS pe Sent Eas oe Re 
Meise hypotenuse adjacent leg 


pa ls tn a a er 
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Important: Ifa = BC, b = AC, and [ABC] is the area of AABC, then | 
1 | 
[ABC] = SabsinC. | 


Important: Leta = BC, b = AC, andc = AB in AABC. The Law of Cosines states that 


2 


= a? + b? — 2ab cos C. 


‘Important: Leta = BC, b = AC, and c = AB in AABC, and let R be the radius of the] 
W circumcircle of AABC (the radius of the circle that passes through all three 
vertices of AABC). The Extended Law of Sines tells us that 


a b c 


SA 7 SnD ae 


Things To Watch Out For! 


When proving a statement for all possible triangles, make sure your proof is valid for all possible triangles. 
Specifically, you may have to address right and obtuse triangles as separate cases from acute triangles. 


Problem Solving Strategies 


Concepts: e Many geometry problems can be solved by assigning variables to lengths 
O= or angles in the problems, and then finding other lengths or angles in terms 
of the variables. The goal is usually to find a way to build an equation that 
you can solve. 


Area can be a surprisingly powerful problem solving tool, even in problems | 
that don’t appear to be about area. 


the formula on specific examples you have solved without the formula. 


Solving a specific example of a general problem can often provide a guide 
| for solving the general problem. 


When stuck on a problem, focus on information you haven't used yet. 


If a tactic gives some information in a problem, but doesn’t completely solve | 
| it, then try applying that tactic to the problem again in another way. 


Most complex problems can be solved in more than one way. So, don’t get 


Once you think you’ve found a new formula, check your work by trying 
| trapped into thinking that you need to find the way to solve the problem. | 
| 


| e Focusing on the key restrictions in a problem often leads to the solution. | 


Extra! On page 135, we asked you to find a 3-4-5 right triangle in a problem. The 
w 3-4-5 triangle is in bold at right. The lower left square in the grid corresponds 
to the square in the original problem. 
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REVIEW PROBLEMS 


4.26 In right triangle STU, we have ZS = 3/T and ZU = 90°. Find ST/TU to the nearest hundredth. 


4.27 Find all possible values of ZPQR if PQ = 12, QR = 4 V3, and the area of APQR is 12 V6. 
4.28 If JK =12,KL=7, and ZJLK = 90°, then what is ZJKL to the nearest degree? 
4.29 Let the graphs of 2x — 3y = 7 and 3x — y = 9 be k and £, respectively. 


(a) To the nearest degree, find the acute angle between k and the x-axis. 
(b) To the nearest degree, find the acute angle between £ and the x-axis. 
(c) To the nearest degree, find the acute angle between k and £. 


(d)x Find an expression for the tangent of an angle between two lines with slopes mı and mz, where mı + mp. 
Test your result with k and £. 


4.30 Two strips of width 1 overlap at an angle of a as shown. What is the area of the overlap 
in terms of a? (Source: AHSME) 


4.31 A boat sails 9 miles per hour along the latitude 30° N. Assuming that the Earth is a sphere 
with radius 3950 miles, how long will it take the boat to sail from longitude 19° W to 46° W (to 
the nearest hour)? i 


4.32, M is the midpoint of side AB of AABC. If ZABC = 90°, and sin ZMCB = 0.8, then what is cos ZACB? 


4.33 Triangle ABC has a right angle at C, AC = 3 and BC = 4. Triangle ABD has a right angle at D 
A and AD = 12. Points C and D are on opposite sides of AB. The line through D parallel to AC 
meets CB extended at E. Find DE/DB. (Source: AHSME) 


4.34 In AGHI, we have GH = HI = 9 and GI = 6. 
(a) Find tan ZGIH. A 


(b) Find tan ZGHI. C B E 
(c) Find cos ZGHI. 


4.35 When I look due west, I look straight at a mountain peak that I know is 4.6 miles away. If I turn northward 
37°, I will be looking directly at a mountain peak that is 5.8 miles away. To the nearest tenth of a mile, how far 
apart are the mountain peaks? 


4.36 What’s wrong with the following proof of the Pythagorean Theorem: 


In the text, we proved that if a = BC, b = AC, and c = AB, then c* = a* + b? — 2ab cos C. If ZC = 90°, then 
we have cos C = 0. Therefore, if ZC = 90°, we have c? = a? + b?. 


4.37 Find the angles of a triangle with side lengths 2, 2, and V6 — V2. 


4.38 In AABC, AB =5, BC =7, AC = 9, and D is on AC with BD = 5. Find AD/DC. (Source: AHSME) 


4.39 Is it possible for the three side lengths of a triangle to form an arithmetic sequence with common difference 
1, and have one angle with measure 60°? Why or why not? 


154 


CHALLENGE PROBLEMS 


SSS ea oe Sars sss Sp ss ES i i SSS 


4.40 We define the dihedral angles between two intersecting planes as follows. Suppose line k is the intersection 
of the planes, and point A is on k. Let point B be in one plane and C in the other such that both AB and AC are 
perpendicular to k. Then, BAC and its supplement are the dihedral angles between the planes. (There are two 
dihedral angles between two intersecting planes, just as there are two angles between two intersecting lines.) 


A regular tetrahedron is a triangular pyramid in which all four faces are equilateral triangles. Find the acute 
dihedral angle between two faces of a regular tetrahedron to the nearest tenth of a degree. 


4.41 In APQR, we have PQ = 4, QR = 6, and ZP = 22R. Find cos ZR. 
4.42 Does there exist a triangle ABC such that AB = 3, BC = 8, ZA = 73°, and ZC = 57°? Why or why not? 


4.43 A plane takes off, flies in straight line for 350 miles, then turns 20° and flies straight for another 2 hours 
before landing at an airport that is 800 miles from where the plane took off. How long did the whole flight take if 
the plane flew at a constant speed throughout the flight? 


4.44 The base of an isosceles triangle is 6 inches and one of the equal sides is 12 inches. What is the radius of the 
circle through the vertices of the triangle? (Source: AHSME) 


4.45 Show that in AABC, if AB cos B = AC cosC, then ZB = ZC. 


4.46 We showed in the text that there is no such thing as the “Side-Side-Angle Congruence Theorem.” However, 
suppose AB = DE = 6, AC = DF = 12, and ZBCA = ZEFD = 30°. Use the Law of Sines to explain why we can 
conclude that AABC = ADFF in this special case. What makes this case special? 


4.47 The Angle Bisector Theorem states that if D is on side BC of AABC such that BAD = CAD, then 
AB/BD = AC/CD. Prove the Angle Bisector Theorem. 


i] Challenge Problems Bies 


4.48 Inthe diagram at right, AD is an altitude of AABC with length 1, and angles A 
B and C are acute. Use the diagram at right to prove the angle sum identity for 
sine, 


sin(a + f) = sin a cos f + sin f cosa. 
C] 
Hints: 278 B D @ 
4.49 Show that in AABC, we have 4cos £ cos 8 cos § = $. 


4.50 Let AABC be an acute triangle whose altitudes meet at point H, and let X be on BC such that AX 1 BC. 
Show that HX = 2R cos B cos C. Hints: 63 


4.51 In triangle ABC, D is on AC and F is on BC. Also, AB L AC, AF L BC, and BD = DC = FC = 1. Find AC. 
(Source: AHSME) Hints: 49 


4.52 In quadrilateral ABCD, BC = 8, CD = 12, AD = 10, and ZA = ZB = 60°. Find AB. (Source: AIME) 
Hints: 129, 101 


4.53 A right circular cone has a base with radius 600 and height 200 V7. Anant starts at a point on the surface of 
the cone whose distance from the vertex of the cone is 125, and crawls along the surface of the cone to a point on 
the exact opposite side of the cone whose distance from the vertex is 375 V2. Find the least distance that the ant 
could have crawled. (Source: AIME) Hints: 259 
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4.54 In AABC, the angles A, B, and C satisfy the equation cos A cos B+sin Asin B sin C = 1. Determine all possible 
values of ZC. (Source: CEMC) Hints: 65 


4.55 Consider a pyramid PABCD whose base ABCD is square and whose vertex P is equidistant from A, B, C, and 
D. Show that if AB = 1 and ZAPB = 20, then the volume of the pyramid is Vcos20/(6sin 0). (Source: AHSME) 


4.56 Points A, B, and C ona circle of radius r are situated so that AB = AC, AB > r, and the length of minor arc 
BC is r. Show that AB/BC = £ csc }. (Source: AHSME) Hints: 266 


4.57 The lengths of the sides of a triangle are consecutive integers, and the largest angle is twice the smallest 
angle. What is the cosine of the smallest angle? (Source: AHSME) Hints: 248 


4.58 Point D is on side CB of triangle ABC. If ZCAD = ZDAB = 60°, AC = 3, and AB = 6, then what is the length 
of AD? (Source: AHSME) 


4.59 An observer walks along a level road directly towards an elevated object P. He takes observations at three 
points A, B, and C, in order, and notes that the angles of elevation of P are 0, 20, and 30, respectively. Prove that 
AB > 2BC. (Source: CEMC) Hints: 238, 133 


4.60 Point D is on BC such that ZBAD = ZCAD. Find AD in terms of a, b, and c, where a = BC, b = AC, and 
c= AB. 


4.61 Triangle ABC is isosceles with AB = AC and altitude AM = 11. Suppose that there is a point D on AM with 
AD = 10 and ZBDC = 3ZBAC. Find the perimeter of AABC. (Source: AIME) Hints: 110, 92 


4.62x Find sin 18°. (See if you can find both a geometric and an algebraic solution.) Hints: 157, 90 


4.63 Let a, b, c be the three sides of a triangle, and let a, p, y, respectively, be the angles opposite them. If 
a> +b? = 1989c%, find 
coty 
cota + cot Bp 


(Source: AIME) Hints: 159 


4.64 Show that if A, B, and C are the angles of a triangle, then tan 4 tan 2 + tan B tan Ç + tan g tan 4 =]. 


4.65x The axis of the Earth makes an angle of 66.5° with the plane 
in which it orbits the sun. The latitude of Vancouver is 49° N, which 
means that if we form an angle by connecting Vancouver to the center of 
the Earth, and then connecting the center of the Earth to a point on the 
equator with the same longitude as Vancouver, the angle has measure a 
49°. (This is LVOE at right.) a ote 


At right is a cross-section of the Earth that includes Vancouver and 
the Earth’s axis. Vancouver is point V, point E is on the equator, point 
O is the center of the Earth, S is the Sun, and VY is perpendicular to the 
Earth’s axis. The summer solstice is the day of the year with the most 
hours of sunlight in the Northern hemisphere. Find the number of hours 
of sunlight Vancouver receives on the summer solstice. Hints: 219, 27, 171 


4.66 Triangle ABC is a right triangle with AC = 7, BC = 24, and right angle at C. Point M is the midpoint of AB, 
and D is on the same side of line AB as C so that AD = BD = 15. Find the area of ACDM. (Source: AIME) 


4.67x A circle is inscribed in quadrilateral ABCD, tangent to AB at P and to CD at Q. Given that AP = 19, 
PB = 26, CQ = 37, and QD = 23, find the square of the radius of the circle. (Source: AIME) Hints: 78 
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4.68x Show that if A, B, and C are the angles of a triangle, then 


cos? A + cos? B + cos? C + 2cos A cos BcosC = 1. 
Hints: 68, 21 
4.69% . 


(a) Prove that if ABCD is a cyclic quadrilateral (a quadrilateral that can be inscribed in a circle), then the area of 
ABCD is v(s — a)(s — b)(s — c)(s — d), where a, b, c, and d are the side lengths of ABCD and s = (a +b + c + d)/2. 
(This result is known as Brahmagupta’s formula.) Hints: 260, 3 


(b)*x Prove that the area of any quadrilateral ABCD is 


(s — a)(s — b)(s — c)(s — d) — abcd cos? (4 > =) j 


(This result is known as Bretschneider’s formula.) Hints: 186, 169, 286 


4.70x In parallelogram ABCD, let O be the intersection of diagonals AC and BD. Angles CAB and DBC are each 
twice as large as angle DBA, and angle ACB is r times as large as angle AOB. Find r. (Source: AIME) Hints: 7 


4.71x Show that in AABC, we have 
BC? cos(B — C) + CA? cos(C — A) + AB? cos(A — B) = 3(BC)(CA)(AB). 
Hints: 265 
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CHAPTER 5. PARAMETERIZATION AND TRIGONOMETRIC COORDINATE SYSTEMS 


Mathematics is the supreme judge; from its decisions there is no appeal. — Tobias Dantzig 


CHAPTER cr 


Parameterization and Trigonometric Coordinate Systems 


5.1 Parameterization 


Imagine we wished to track the path of a particle traveling within the Cartesian plane. Suppose that we want to 
know more than just the path it takes, but we also want to know where it is at each moment in time. Then, saying 
that the particle travels along the graph of an equation like y = x + 2 won’t do—this doesn’t tell us where the 
particle is at each moment in time. Instead, we can use equations called parametric equations. 


Rather than having a single equation that relates x and y, we can define a relationship between x and y using an 
intermediary variable called a parameter. We do so by expressing x and y as functions of the parameter, meaning 
we relate x and y with equations of the form x = f(t), y = g(t), where t is the parameter. (We often use t as the 
variable for a parameter because we frequently use a parameter to represent time.) We refer to the equations 
x = f(t), y = g(t) as parametric equations. When we graph parametric equations x = f(t), y = g(t), we graph all 
points (x, y) such that there is a value of t for which x = f(t) and y = g(t). If the possible values of the parameter are 
not specified, then it is implied that the parameter can take on any real value for which the equations are defined. 


For example, we can use the parametric equations 


x=2-3t, 
y=1+2t, 


to represent the location of a particle that is at (x, y) = (2, 1) at time t = 0 and moves 3 to the left and up 2 for each 
unit of time. When t = 4 in these parametric equations, we have x = —10 and y = 9, so (x, y) = (-10,9) is on the 
graph of the parametric equations. If we let t = —1, we have (x, y) = (5,-1), so if we leave t unrestricted, then 
(x, y) = (5,-1) is on the graph. To restrict the graph to only those points the particle will pass through from the 
time it starts at (2,1), we would restrict the parameter to t > 0. 
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a > 


Problem 5.1: In this problem, we find the graph of the parametric equations x = 2 — 3t, y = 1 + 2t. 


(a) Choose several values of t and compute (x, y) for each value. Plot the resulting points. What does it look 
like the graph of all such points will be? 


(b) Eliminate ¢ from the parametric equations x = 2 — 3t, y = 1 + 2t to find an equation in which the only 
variables are x and y. Show that for every pair (x1, y1) that satisfies this new equation, we can find a value 
of f in the parametric equations that gives (x, y) = (x1, y1). 


Suppose we limit £ to the interval [4,8]. How does this affect the graph? 


Problem 5.2: Find the graph of the parametric equations x = 1 —t, y = 2f — t. 
Problem 5.3: Find the graph of the parametric equations x = 3 + 2cost, y=4+2sint for 0 < t < 27. 
Problem 5.4: Find the graph of the parametric equations x = sin 6, y = 3 — 2 cos 20. (Be careful!) 
Problem 5.5: 


(a) Find parametric equations for x and y such that the graph of the parametric equations is the same as the 
graph of x7 + y* = 1. 


(b) Is your answer to part (a) unique? Are there any other parametric equations that produce the same 


graph? 


(c) Suppose a particle starts at the point (0, —2), and moves at a constant speed clockwise around the origin 
along the graph of x*+ y? = 4. If the particle completes a full revolution in 4 seconds, then find parametric 
equations for the location of the particle in terms of t, where t is the time in seconds since the particle 
started moving. 


Problem 5.6: Find parametric equations for x and y such that x? - 34? = 1. 


Problem 5.7: I’m riding my bike at a constant speed of 20 miles per hour on flat land when I ride over a dot 
of fresh yellow paint, thereby getting a yellow spot on my front wheel, which is 2 feet in diameter. I continue 
riding my bike in a straight line at 20 miles per hour. When f seconds have passed since running over the dot, 
how many feet is the yellow spot on my tire from the yellow dot I ran over? 


| Problem 5.1: In this problem, we find the graph of the parametric equations x=2- 3t, y=1 ++ Di 
(a) Choose several values of t and compute (x, y) for each value. Plot the resulting points. What does it 
look like the graph of all such points will be? 


(b) Eliminate t from the parametric equations x = 2 — 3t, y = 1 + 2¢ to find an equation in which the only 
variables are x and y. Show that for every pair (xı, y1) that satisfies this new equation, we can find a 
value of t in the parametric equations that gives (x, y) = (x1, y1). 


| 
| 
| 
| 


Suppose we limit t to the interval [4,8]. How does this affect the graph? 


Solution for Problem 5.1: 


(a) Atleft atop the following page we have a table of values of t and corresponding values of x and y. At right, 
we have graphed several of the resulting points (x, y) on the Cartesian plane. 
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(b) 


(c) 


© x= 23) (y= i or 


—4 14 =) 
18) 11 =) 
A 8 =5 
=! 5 =I 
0 2 1 
1 =Í 3 
2 —4 5 
3 =7 7 


The points on the graph look collinear. It looks like the graph of the parametric equations is a line. 


We can confirm our guess from part (a) by eliminating the parameter from the parametric equations. 
Adding 2 times x = 2 — 3t to 3 times y = 1 + 2t gives 2x + 3y = 7. The graph of this equation is indeed a line. 


Concept: We can sometimes better understand a set of parametric equations by eliminating 
Q=» the parameter from the equations. | 


Unfortunately, this only tells us that every point in the graph of the parametric equations is in the graph 
of 2x + 3y = 7. In order to conclude that the graphs are the same, we must show that every point in the 
graph of 2x + 3y = 7 is in the graph of the parametric equation. Fortunately, that’s not too hard. Suppose 
that (x1, y1) is in the graph of 2x + 3y = 7, so we have 2x; + 3y; = 7. To show that (x1, y1) is in the graph of 
the parametric equations, we must show that there is a value of t that gives (x, y) = (x1, y1). To find that t, 
we solve the parametric equation x = 2 — 3t for t, which gives t = (2 — x)/3. So, we let t = (2 — x;)/3 in the 
parametric equations, and we have 


2-X 
me Lyi 
X 3 x4 

ne yh 
ae ae A data z 
EE A AE E N T OT ONE 


which means (x1, y1) is in the graph of the parametric equations. Therefore, every point in the graph of 
2x + 3y = 7 is in the graph of the parametric equations. We conclude that the graph of x = 2 — 3t, y= 1+ 2t 
is the same as the graph of 2x + 3y = 7. 


WARNING!! 
ry 


If parametric equations x = f(t), y = g(t) satisfy an equation in terms of | 
x and y, we only know that the graph of the parametric equations is part 
of the graph of the non-parametric equation. To show that the graph of 
the parametric equations is the same as the graph of the non-parametric 
equation, we must show that every point in the graph of the non-parametric 
equation is in the graph of the parametric equations. We can usually do so 
by showing that for each point (x1, y1) on the graph of the non-parametric 
equation, there is a value of t such that x; = f(t), y1 = Q(t). 


When t = 4, we have (x,y) = (—10,9), and when t = 8, we have (x, y) = (—22,17), so the graph of the 
parametric equations when t is restricted to the interval [4,8] is the line segment from (—10,9) to (—22, 17). 
To see that every point on this segment is on the graph, and no other points are, we can start from 4 < t < 8, 
multiply by —3 to produce —12 > —3t > —24, and then add 2 to get —10 > 2 — 3t 2 —22. Since x = 2 — 3t, 
we have -10 2 x 2 —22. So, every point on the line from part (b) for which —10 > x = —22 is produced by 
values of t in the interval [4,8], and no other values of x are possible. 


Problem 5.2: Find the graph of the parametric equations x = 1 — t, y = 2? — t. 
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Solution for Problem 5.2: We start as in Problem 5.1, choosing values of t and solving for x and y. 


t x=1-t y=28-t 
-3 4 21 
-2 3 10 
=l 2 3 
0 1 0 
1 0 1 
2 =! 6 
3 -2 15 


The ¢* in the parametric equation for y suggests the graph might be a parabola, and the dashed parabola in the 
diagram seems to fit the data nicely. To see that the graph is indeed a parabola, we try to relate x directly to y. We 
can’t use elimination, as we did in Problem 5.1, but substitution does the job. From x = 1 — t, we have t = 1 — x, so 


y = 2¢? —¢ = 2(1 — x)? —-(1— x) = 2? — 40 42-14% =2x7 -—3x +1. 


As in the previous problem, we still must show that every point in the graph of y = 2x? — 3x + 1 is in the graph 
of the parametric equations. For any point (xı, y1) in the graph of y = 2x? — 3x + 1, we have yı = De — 3x, +1. 
Solving the parametric equation x = 1 — t for t gives t = 1 — x. So, we let t = 1 — x; in the parametric equations, 


and we have 
6=—f—=1—(1-—x)) =X, 
y = 2 -t=2(1- 4} -(1- x1) = 2x¢ -3x, +1 = y1, 


which means that (x1, y1) is in the graph of the parametric equations, as well. Therefore, the graph of the parametric 
equations is the same as the graph of y = 2x* — 3x + 1, which is the dashed parabola shown above. 0 


| Problem 5.3: Find the graph of the parametric equations x = 3+2cost, y=4+2sint for 0 < t < 27. 


Solution for Problem 5.3: We choose values of t for which we can evaluate sine and cosine easily. 


x=3+2cost y=4+2sint 


x=3+2cost y=4+2sint 


1 4 
7/6 3+ 3 5 71/6 3-3 3 
n/3 4 4+ V3 4n/3 2 4-43 
m/2 3 6 3n/2 3 2 
27/3 D 4+v3 57/3 4 4-3 
57/6 3- v3 5 117/6 3+ v3 3 


It looks like the graph is the circle shown at right. We can’t use either the elimination tactic of Problem 5.1 
or the straightforward substitution tactic of Problem 5.2 to turn the parametric equations into a single equation 
with only x and y. So, we look for other strategies for eliminating the cost and sint terms, which brings us to the 


trigonometric identity sin? t + cos?t = 1. 


Concept: Trig identities can be very useful in working with parametric equations. | 


Solving x = 3+2 cost and y = 4+2 sin t for cost and sin t give cost = (x—3)/2 and sin t = (y—4)/2. Substituting 
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these into sin’ t + cos? t = 1 gives 


Multiplying both sides by 2? gives 
(x -3)? + (y—-4)* = 2?. 


The graph of this equation is a circle with center (3, 4) and radius 2, as expected, and as depicted above. 


We're not finished yet! We have only shown that every point in the graph of the parametric equations is on 
the circle. We still have to show that every point on the circle is in the graph of the parametric equations. 


We know one circle that is related to cosine and sine—we use the unit circle to define cosine and sine. From 
this definition, the unit circle is the graph of the parametric equations x = cost, y = sint. Multiplying the right 
sides of these equations simply scales every point away from the origin by a factor of 2, so the graph of x = 2 cost, 
y = 2sint is a circle centered at the origin with radius 2. We produce the graph of x = 3+ 2cost, y = 4 + 2sint 
by translating each point on the graph of x = 2cost, y = 2sint to the right 3 and up 4. Therefore, the graph of 
x=3+2cost, y=4+2sint is the full circle centered at (3,4) with radius 2. O 


Problem 5.4: Find the graph of the parametric equations x = sin 0, y = 3 — 2 cos 20. (Be careful!) 


Solution for Problem 5.4: We can write cos 20 in terms of sin@, so we can easily eliminate the parameter by 
substituting x for sin 0 in the resulting expression: 


y =3-2cos20 =3-2(1 —2sin* 0) = 3 - 2(1 — 2x’) = 4x7 + 1. 
What’s wrong with this finish: 


Bogus Solution: Therefore, the graph of the parametric equations is the parabola thatresults 


from graphing the quadratic y = 4x? + 1. 


We see the problem when we look back at the parametric equations. We have x = sin 0, so x 
cannot be greater than 1 or less than —1. There is an angle @ such that sin 0 = x for each x from 
—1 to 1. Therefore, the graph of the parametric equations is the graph of y = 4x? + 1 for which 
-1 < x < 1, not the entire parabola that is produced by graphing y = 4x* + 1 without restriction 
on O 


In Problem 5.4, we found that the parametric equations x = sin 0, y = 3 — 2cos 20 satisfy the 
equation y = 4x? + 1. However, the graph of the parametric equations is not the same as the graph 
of y = 4x? + 1. The fact that the parametric equations x = sin 0, y = 3 — 2 cos 20 satisfy y = 4x? +1 
only tells us that the graph of the parametric equations is part of the graph of y = 4x? + 1. We then must consider 
the restrictions of the parametric equations to determine what portion of the graph of y = 4x? + 1 is the graph of 
the parametric equations. 


WARNING!! As noted earlier, if parametric equations x = f(t), y = g(t) satisfy an equation 

“S in terms of x and y, then we only know that the graph of the parametric 
equations is part of the graph of the non-parametric equation. Sometimes the | 
parametric equations have restrictions that limit the graph of the parametric 


equations to only a portion of the graph of the non-parametric equation. 


Our first few examples have given us a sense of how parametric equations work, but not much sense for what 
parametric equations are good for. Unfortunately, you'll have to wait until your study of calculus and physics to 
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put parametric equations to heavy use, but Problem 5.3 gives some hint as to why parametric equations can be 
useful. In that problem, we showed that the equation (x — 3)* + (y — 4)* = 2? can be modeled with the parametric 
equations x = 3+2cost, y=4+2sint. We cannot write the equation (x — 3} + (y — 4)? = 2? such that one of x or y 
is a function of the other. However, our parametric equations allow us to write both x and y as functions of some 
other variable. The ability to express x and/or y as functions of another variable becomes particularly important 
in calculus, which offers very powerful tools that we can apply to functions. 


Problem 5.5: 
(a) Find parametric equations for x and y such that the graph of the parametric equations is the same as the 
graph of x? + y* = 1. 


(b) Is your answer to part (a) unique? Are there any other parametric equations that produce the same 
graph? 


(c) Suppose a particle starts at the point (0, —2), and moves at a constant speed clockwise around the origin 
along the graph of x*+y? = 4. If the particle completes a full revolution in 4 seconds, then find parametric 
equations for the location of the particle in terms of t, where t is the time in seconds since the particle 
started moving. 


Solution for Problem 5.5: 


(a) The graph of x? + 4? = 1 is the unit circle. Any point on the unit circle is the terminal point of some angle 
0. We defined the functions cos 0 and sin @ such that the coordinates of this terminal point are x = cos 0, 
y = sin 0. Therefore, we say that x = cos 0, y = sin 0 is a parameterization of x? + y? = 1 because the graph 
of the parametric equations x = cos 0, y = sin @ is the same as the graph of x? + y7 = 1. 


(b) Our answer in part (a) isn’t the only parameterization of x? + y? = 1. We also could have let x = cos(-8), 
y = sin(-@). Then, we still have x? + y? = 1, and the parameterization produces every point on the 
circle. What’s the difference between the two parameterizations x = cos 0, y = sin@ and x = cos(-0), 
y = sin(—@)? Both parameterizations produce the same curve, but if we care about how the curve is traced 
out as we vary 9, then there is a difference between these two parameterizations. As 0 goes from 0 to 
27, the curve described by x = cos 6, y = sin 0 starts at (1,0) and is produced by moving along the unit 
circle counterclockwise about the origin. On the other hand, as 0 goes from 0 to 27, the curve described by 

= cos(—8), y = sin(—9) starts at (1,0) and is generated clockwise from the origin. 


There are infinitely many other ways we could have chosen the parametric equations. The equations 
x = sin2t, y = cos 2t work, as do x = —cosmt, y = -sin nt and x = sin(f? — t), y = —cos(f — f°). While it 
doesn’t matter which of these parameterizations we use to describe the graph of x” + y? = 1, it can be very 
important which parameterization we choose when using parametric equations to model a process. In our 
next part, we'll see why. 


(c) The first difference we see between this and part (a) is that the circle has radius 2 instead of radius 1. So, 
we might think to use the parameterization x = 2cost, y = 2sint. But, as described in part (b), a particle 
that follows the path given by x = 2 cost, y = 2sint will go counterclockwise, not clockwise. We saw in part 
(b) how to deal with that wrinkle. A particle that follows the path given by x = 2cos(-#), y = 2sin(-t) 
will travel clockwise as t increases. However, when t = 0, this parameterization gives (x, y) = (2,0), and 
we want the particle to start at (0, —2). We take care of that with a phase shift, using the parameterization 
x = 2cos (-t = z), y =2sin (-t- z), 

We're not finished yet! We still have to worry about how fast the particle travels around the circle. The 
particle makes a full revolution in 4 seconds, so it must be back at (0, —2) after 4 seconds. Therefore, we 
need the periods of the trig functions we use in our parameterization to be 4. Since the period of a function 
of the form sin at or cos at is a (for a + 0), we must have |a| = 5. Therefore, a parameterization for the path 
of the particle is x = 2 cos (-3 = z), y = 2sin (-3 = z), 


This isn’t the only way we can express this parameterization. We can use trigonometric identities to 


—_——— Oe OT 
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simplify it. Since cos (Ə = z) = sin 0 and sin (o = z) = —cos 0, we can express our parameterization as 
x = 2sin (-2 e y = —2cos (-3 I Also, noting that sin(—@) = — sin @ and cos(—@) = cos 0, we can express 
the parameterization as x = —2 sin (zt), y = -2 cos (zi). 


We can check this answer by noting that when t = 0, we have (x, y) = (0, —2), as expected. When t = 1, 
the particle has traveled one quarter-circle clockwise about the origin, and be at (—2,0). Sure enough, our 
parameterization gives (x, y) = (—2,0) when ł = 1. 


E 


We say that we parameterize an equation when we find parametric equations for the variables in the equation. 
So, for example, we found that we can parameterize x? + y* = 1 as x = cos 0, y = sin 0. However, we also saw that 
this is not the only way to parameterize x? + 4? = 1. 


Concept: There are usually many ways to parameterize a given equation. 


Problem 5.6: Parameterize the equation x? — 3y? = 1. 
Solution for Problem 5.6: The graph of x? — 3y* = 1 is a hyperbola. What’s wrong with this solution: 


Bogus Solution: We add 3y? to both sides and take the square root. This gives us the 
equation x = + ,/1 + 347. So, the graph of x = + V1 + 3#, y = t is the same 


as the graph of x? - 3y* = 1. 


The problem here is that + V1 + 3¢* is not a function. We can say that the graph of x = V1 +3¢, y = t together 


with the graph of x = —V1 + 32, y = t gives us the graph of x? — 3y = 1, but we still do not have a single pair of 
functions f(t) and g(t) such that the graph of x = f(t), y = g(t) is the graph of x? — 3y? = 1. 


We’ve already seen a couple of examples in which trig identities helped us relate parametric equations to an 
equation in which x and/or y are squared, so we look for a way for apply trig identities here. Specifically, we 
look for identities in which we have a difference of squares of trig functions, since the equation x? — 3y* = 1 has a 


difference of squares of variable terms x and V3y. We might consider the trig identity 
cos? 0 — sin? 0 = cos 20, 


but that isn’t promising for a couple of reasons. First, there’s the cos 29 on the right—what will we do with that? 
Also, we figure that x probably can’t be some constant times cos 0 or sin 0, since there are solutions to the equation 
x? — 3y? = 1 in which x and y are arbitrarily large numbers. Back to the drawing board. 


An identity that involves squares of trig functions besides sine and cosine is tan? O + 1 = sec? @. Rearranging 
this gives sec? 0 — tan? @ = 1. Aha! If we let x? = sec? 0 and 3y? = tan? 0, then x? — 3y? becomes sec? 0 — tan? 0, 
which we know equals 1. So, we can let x = sec 0 and y = 5 tan 0 be the desired parametric equations. (We could 
have taken the negative square root in either equation and produced valid parametric equations, too.) 


But why can we “take the square root” here to get a valid parameterization, but we couldn’t do so in the Bogus 
Solution? We rejected the result of Bogus Solution, x = + V1+ 3, y = t because + V1 + 3¢? is not a function. But 


what if took the positive square root and wrote the parameterization as x = V1 + 3¢, y = t? The problem here is 
that this parameterization only produces points for which x > 0. All points with x < 0 that are on the graph of 


x? — 347 = 1, such as (-1,0), are excluded from the graph of x = V1 + 3#, y =t. 
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With this in mind, we check to make sure that the graph of x = secO, y = an tan 6 does capture the full 


hyperbola that is the graph of x° — 3y* = 1. As 0 ranges from —5 to 5, we produce the branch of the hyperbola 
for which x is positive, where we produce the upper half of this branch when 0 < 0 < ž and the lower half when 
-3 < 0 <0. Then, as 0 ranges from % to 32, we produce the branch of the hyperbola for which x is negative. So, 
the entire hyperbola is produced by graphing x = sec Ø, y = $ tan 0. O 

Problem 5.7: I’m riding my bike at a constant speed of 20 miles per | hour on flat land when I ride over a dot 
of fresh yellow paint, thereby getting a yellow spot on my front wheel, which is 2 feet in diameter. I continue 
riding my bike in a straight line at 20 miles per hour. When t seconds have passed since running over the dot, 
how many feet is the yellow spot on my tire from the yellow dot I ran over? 


Solution for Problem 5.7: What's wrong with this solution: 


Bogus Solution: Iride 20 miles per hour for a time t, so the spot on the tire is now 20t away | 
aig from the dot on the road. 


There are several mistakes here. First, t is in seconds, my speed is in miles per hour, and we want the answer 
in feet. So, we have to convert 20 miles per hour to feet per second. Since there are 5280 feet ina mile, 20 miles is 
20 - 5280 feet. There are 60 minutes in an hour and 60 seconds in each minute, so there are 60 - 60 = 3600 seconds 
in an hour. Therefore, 20 - 5280 feet per hour is 20 - 5280/3600 feet per second, which is approximately 29.33 feet 
per second. 


The more interesting mistake comes from the fact that the answer is not simply 29.33t feet. The bike has moved 
29.33t feet forward, because it has moved in a straight line at 29.33 feet per second for t seconds. However, the 
spot has not moved in a straight line. Sometimes it is touching the ground, sometimes it is at the top of the wheel, 
and most of the time it is somewhere in-between. The spot doesn’t move ina straight line—as the center of the 
wheel moves forward, the spot rotates around it at a constant rate. 


To figure out where the spot is after t seconds, we have to take both the forward motion of the bike and the 
rotary motion of the wheel into account. We’ll do so by considering the horizontal motion of the spot and the 
vertical motion of the spot separately. We let x(t) be the distance forward the spot has moved in t seconds, and 
y(t) be how far off the ground the spot is after t seconds. In other words, we will find parametric equations for the 
location of the spot. Then, the distance from the starting point of the spot to its location after t seconds is simply 


V (x(t)? + (yb). 


First, we tackle the horizontal movement. The center of the wheel moves forward at a speed of approximately 
29.33t feet per second. However, the spot has not always moved the same distance forward as the center of the 
wheel has. Sometimes it’s a little ahead of the center of the wheel, and sometimes it’s a little behind the center of 
the wheel. To figure out exactly where the spot is relative to the center of the wheel, we have to figure out how 
much the spot has rotated around the center of the wheel. So, we have to figure out how much the wheel has 
turned. 


For the moment, we'll forget about the fact that the wheel is moving forward and focus on the rotation, so we 
can figure out where the spot is relative to the center of the wheel at time t. The spot starts directly below the 
center of the wheel, and then turns an angle 0. We can figure out where the spot is relative to the center in much 
the same way we tackled the Ferris wheel problem on page 62. We think of the wheel as the unit circle. Moreover, 
the radius of the bike wheel is 1 foot, so, because we are using feet as our units, the wheel really is a unit circle. 
AA 

Extra! The path the spot follows in Problem 5.7 is called a cycloid. Galileo gave the curve its name, and 
wiii mathematicians of the 17th century argued so much about properties of the curve that it became 
known as the “Helen of Geometers.” Source: “Cycloid” by E. Weisstein 


165 


CHAPTER 5. PARAMETERIZATION AND TRIGONOMETRIC COORDINATE SYSTEMS 


The spot starts at the “bottom” of the circle, corresponding to an angle of 3, 
We let the wheel move in the “positive” direction, as shown, so that the wheel 
rotates clockwise. Suppose the spot rotates clockwise an angle 0 about the center, 
as suggested by the grey spots and arrows. We can think of the spot as the terminal 
point of the angle 3# — 0. So, the spot’s horizontal displacement from the wheel’s 


center is cos (3% - ) feet and its vertical displacement from the center is sin (= - ø) 


feet. To find x(t) and y(t), we must combine the horizontal displacement relative to 
the center of the wheel with the fact that the wheel moves forward 29.33t feet, and 
we note that the center of the wheel is 1 foot off the ground. So, we have 


x(t) x 29.33t + cos (= = 0) ; 


y(t) =1+ sin (= -0). 


Uh-oh. What do we do about 0? We have to express 0 in terms of t, too. Fortunately, that’s not too hard to do. 
When the wheel turns a full rotation of 27, the bike moves forward by the circumference of the wheel, which is 
2n feet, since the diameter of the wheel is 2 feet. So, the number of radians the wheel turns equals the distance the 
bike moves forwards in feet (this is a convenient coincidence caused by the fact that the wheel has radius 1 foot). 
Therefore, in the t seconds that the bike moves forward 29.33t feet, the wheel turns an angle 0 = 29.33t radians. 
So, the parametric equations for the position of the spot are 


a 
Direction of bike 


3 
x(t) = 29.33t + cos (5 = 29.33) : 


y(t) 1+ sin (= = 29.331) : 
Applying angle sum identities for sine and cosine, we have 


x(t) x 29.33t — sin 29.33t, 
y(t) = 1 — cos 29.33t. 


Therefore, the distance from the spot on the wheel to the dot on the road is 


/ (x(t)? + (y(b)2 = y/(29.33t — sin 29.334)? + (1 — cos 29.33t)? 
~ \/ 860.2512 — 58.66t sin 29.33t + sin? 29.33t + 1 — 2 cos 29.33t + cos? 29.33¢ 
= V860.25? — 58.66t sin 29.33t — 2 cos 29.33t + 2. 


E 


= + 
5.1.1 What is the slope of the line that is the graph of x = 3t +7, y = 8t — 5? 
5.1.2 Find the graph of the parametric equations x = 3 cos 2t, y = —3 sin 2t. 
5.1.3 Find the graph of the parametric equations x = -3 + 5cos 6, y= 2 + 5sin 0. 
5.1.4 Find the graph of the parametric equations x = 4cos 2t, y = 1 + cos? 2t. 
5.1.5 Find parametric equations for x and y such that 4x? + 9y* = 144. 


5.1.6 A particle starts at (1,5) and travels at a constant speed in a circle around the point (4,5). Find para- 
metric equations to describe the location of the particle t seconds after it starts moving if the particle travels 
counterclockwise and takes 87 seconds to complete a full revolution around the circle. 
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5.1.7x What is the graph of the parametric equations x = ear SA = ? Hints: 50, 128 


5.2 Polar Coordinates 


When we plot points on the Cartesian plane, we typically identify the location of each point with coordinates that 
denote where the point is horizontally and vertically with respect to a point we call the origin. These are the x- 
and y-coordinates that you're already used to; we sometimes refer to these as rectangular coordinates. While this 
is the most common method we use to identify the location of a point on a plane, it is not the only method. 


Polar coordinates offer another method for denoting the location of 90° 

a point on a plane. As with rectangular coordinates, we start with a 120° 60° 
point we call the origin and we identify the location of each point with an F 
ordered pair, (r, 0). The r-coordinate is the distance from the point to the 
origin; we call this coordinate the radial coordinate. The 0-coordinate 
is the angular coordinate, which we define in the same way we related 
angles to points on the unit circle. A point with 0 = 0° is directly to 
the right of the origin, a point with @ = 90° is directly above the origin, 
a point with @ = 180° is directly to the left of the origin, and so on, as 
depicted at right. For example, point A at right has polar coordinates 
(3,30°) and point B has polar coordinates (2, 240°). 


While each point only has one possible representation in rectangular 240° 300° 
coordinates, each point has multiple representations in polar coordinates. 270° 
For example, we can add 360° to the angle, or subtract 360° from it. So, the polar coordinates of A could be written 
(3, 390°) or (3, 750°). We can also use a negative radius, which means “go in the direction opposite the one indicated 
by 0.” For example, we could express A as (—3, 210°), which tells us that A is 3 units in the direction opposite 
210°, which is the direction of 30°. Similarly, the polar coordinates (r, 0) and (—r, 6 + 180°) always refer to the same 
point. We can also use a negative angle, so point B above could be expressed as (2, -120°). 


Because we write both rectangular and polar coordinates as ordered pairs, it’s 
easy to get confused about which we mean when we write a specific ordered 
pair. Ordered pairs of coordinates usually refer to rectangular coordinates, 
so if you're using polar coordinates, you should be very clear that you are 
doing so. 


| __Probiems ils 


Problem 5.8: Convert each of the following rectangular coordinates to polar coordinates. 


(a) (0,-3) (c) (-2 V3, -6) 
(b) (5,5) 


WARNING!! 
bA 


Problem 5.9: Convert each of the following polar coordinates to rectangular coordinates. 
(a) (8,60°) (c) (4V2,315°) 
(b) 
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Problem 5.10: 
(a) What are the rectangular coordinates of the point with polar coordinates (1, 0)? 


(b) What are the rectangular coordinates of the point with polar coordinates (r, 0)? (You can assume r > 0.) 


(c) Suppose point P is (x,y) in rectangular coordinates and (r,9) in polar coordinates, where r > 0. Find r 
in terms of x and y. How must we modify our answer if we remove the restriction r > 0? How is tan 0 
related to x and y? 


Problem 5.11: When we graph an equation that has the variables x and y, the graph consists of all points (x, y) 
on the Cartesian plane that satisfy the equation. Similarly, the graph of an equation that has the variables r and 
0 consists of all points with polar coordinates (r, @) that satisfy the equation. 

(a) What is the graph of r = 5? 
(b) What is the graph of 0 = 20°? 


Problem 5.12: 
(a) Graph the equation r = 6sin 6. Use your graph to guess the rectangular form of this equation. 


(b) Prove that r = 6sin @ is equivalent to the rectangular form you find in part (a). 


Problem 5.13: In rectangular form, the general form of an equation whose graph is the circle with center (h, k) 
and radius t is 


(x-h} + (y-k? =P. 


In this problem, we find the polar form of this equation. That is, we find the general form of an equation whose 
graph is the circle with center (ro, œ) (in polar coordinates) and radius t. 


(a) Express h and k in terms of ro and a. 


(b) Show that the polar form of the given rectangular form is 1? — 2rro cos(0 — a) + 73 =P. 


Problem 5.14: > 
(a) What are the rectangular coordinates of the point that results when (5 ¥2,5 V6) is rotated an counter- 
clockwise about the origin? Hints: 48 


(b) Show that if the point (x, y) is rotated @ counterclockwise about the origin, the result has rectangular 
coordinates 


(x cos 0 — ysin 0, y cos 8 + xsin 0). 


Problem 5.15: Let i and k be constants, and let a and b be positive constants witha > b. The graph of an equation 
of the form 
wame O o 
Era p? 
is an ellipse whose major axis has length 2a. Every ellipse in the Cartesian plane with a horizontal major axis is 
the graph of an equation of this form. 


The graph of the equation 7x? — 6 V3xy + 13y? = 64 is also an ellipse. Find the length of its major axis. 
Hints: 246 


Problem 5.8: Convert each of the following rectangular coordinates to polar coordinates. 


(a) (0,-3) (b) (5,5) (c) (-2 V3,-6) 
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Solution for Problem 5.8: 


(a) The point (0, —3) is 3 units directly below the origin. Because the point is 3 units from the origin, the point 
has r = 3 in polar coordinates. Since the point is directly below the origin, the point has 6 = 270°. So, 
the polar coordinates of the point are (3, 270°). As noted in the introduction to this section, there are many 
other ways we can express the polar coordinates of the point, such as (3, —90°), (-3, 90°), or (3, 630°). Make 
sure you see why each of these describes the same point as (3, 270°). 


(b) The point (5,5) is point P at right. The rectangular coordinates give us PA = AO = 5, so 


APAO is an isosceles right triangle. Therefore, we have PO = 5 V2 and ZPOA = 45°, and 
the polar coordinates of point P are (5 V2, 45°). 


polar coordinates, we note that QO = ,/QA2 + AO? = 4 V3. Because the hypotenuse of 
AQAO is double one of the legs, we recognize that AQAO is a 30-60-90 right triangle. Since 
ZQOA is opposite the longer leg, we have ZQOA = 60°. Point Q is 60° counterclockwise 
from the negative x-axis, so its angle in polar coordinates is 180° + 60° = 240°. Therefore, 
point Q in polar coordinates is (4 V3, 240°). 


y 
| P 
: 
O A 
(c) Point Q at right is (-2 V3, -6), so we have AO = 2 V3 and QA = 6. To find the radius in A y 
| n 
Q 


Problem 5.9: Convert each of the following polar coordinates to rectangular coordinates. 


(a) (8,60°) b) (12-4) (c) (4-V2,315°) 


Solution for Problem 5.9: 


(a) Let P be the point with polar coordinates (8, 60°), so that PO = 8 and ZPOA = 60° in the 
diagram at right. From 30-60-90 triangle POA, we have AO = *2 = 4 and PA = AO V3 = 
4 V3, so the rectangular coordinates of point P are (4,4 V3). 


(b) The point Q with polar coordinates (ea -22 is % clockwise from the positive x-axis. 4 y 
Therefore, Q is in the third quadrant, as shown at right. Drawing altitude QA produces 


right triangle QOA with hypotenuse QO = 12, from which we find QA = 6 V3 and 
OA = 6. Since Q is in the third quadrant, both rectangular coordinates are negative, so 


the rectangular coordinates of Q are (—6, —6 v3). 


Q 
(c) A point with angle 315° in polar coordinates is in the fourth quadrant, 45° clockwise Y 
from the positive x-axis as shown at right. From 45-45-90 triangle PAO, we find 
PA = AO = 4, so the rectangular coordinates of P are (4, —4). (Make sure youseewhy O A x 
the y-coordinate is negative.) = 
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Problem 5.10: 
(a) What are the rectangular coordinates of the point with polar coordinates (1, 0)? 


(b) What are the rectangular coordinates of the point with polar coordinates (r, 0)? (You can assume r > 0.) 


(c) Suppose point P is (x, y) in rectangular coordinates and (r, 0) in polar coordinates, where r > 0. Find r 
in terms of x and y. How must we modify our answer if we remove the restriction r > 0? How is tan 0 
related to x and y? 


Solution for Problem 5.10: 


(a) The point (1, 0) is one unit from the origin, so it is on the unit circle. This point results when we rotate the 
point (1, 0) an angle of 6 counterclockwise about the origin, which means that (1, 0) are the polar coordinates 
of the terminal point of O. We used this point to define cos 0 and sin 0! Specifically, by our definitions of 
sine and cosine, the rectangular coordinates of the terminal point of 0 are (cos 0, sin 0). 


(b) We can use our result from part (a) to quickly convert the polar coordinates (r, 0) 

to rectangular coordinates. In the diagram at right, point Q has polar coordinates 
(1, 8), so itis on the unit circle. Point P has polar coordinates (r, @), so line ze) passes 
through the origin, as shown. Drawing altitudes from P and Q to the x-axis gives 
similar triangles PAO and QBO as shown. Because these triangles are similar, we 
have 

PA _ AO _ PO 

QB BO QO 


From the polar coordinates, we have PO = r and QO = 1, so = ae gives us PA = r- QB, and ag = os 


gives us AO = r- BO. Therefore, the distance from P to each axis is r times the corresponding distances 
from Q to each axis. Combining this with the fact that P is in the same quadrant as Q, we see that the 
rectangular coordinates of P are r times the rectangular coordinates of Q. So, the rectangular coordinates of 
P are (r cos 0, r sin 0). 


We can check this result by using it to confirm our answers to Problem 5.9. Converting (8, 60°) to 
rectangular coordinates gives (8 cos 60°, 8 sin60°), which is (4,4 V3). Converting (12, — 22) to rectangular 
coordinates gives (12 cos (- ar) , 12 sin (- 2r) ) , which is (—6, —6 v3). Converting (4 V2, 315°) to rectangular 


coordinates gives (4 V2 cos 315°, 4 V2 sin 315°), which is (4, —4). All three results match the ones we found 
earlier. 


(c) The distance from (x,y) in rectangular coordinates to the origin is 4/x? + y2. Therefore, when (x,y) is 
converted to polar coordinates such that r > 0, we have r = ,/x? + 42. If we remove the restriction that 
r > 0, then we must instead write this relationship as 77 = x? + 4. 


From the previous part, we have x = r cos 0 and y = rsin 0. If x + 0, we can divide the latter equation 
by the former to get #82 = 


= ee tan@ = z, If we restrict r to be nonnegative, then we cannot deduce 


from tan@ = # that 0 = arctan“, because the range of arctangent is (-3, z). For example, if we have 
(x,y) = (-3,-3), then @ = arctan gives us 0 = 7, but the angle when we express (—3,-3) in polar 
coordinates with r > 0 is 3! radians, which is not the same angle as % radians. If we allow r to be negative, 
then we can let 6 = arctan z, but then we must use the quadrant of (x, y) to determine whether r is positive 


or negative. 


The best we can do to express @ simply in terms of x and y is to write tan 0 = 4 (for x # 0). If x # 0, there 
will be two values of O with 0° < @ < 360° that satisfy this equation. We choose whichever one corresponds 
to the quadrant of (x, y). 
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Important: If (x, y) in rectangular coordinates is the same point as (r, 0) in polar coordinates, 


then 
x=rcos0, y=rsin06, fr =x +3, tang = 2, 


where tan@ = 4 holds only if x + 0. If x = 0 and y # 0, then cos@ = 0 and 
r = 4°. If x = y =0, then r = 0 and 0 can be anything. 


There’s no need to memorize these relationships; you can “see” them whenever you IE 
need them by visualizing the diagram at right, which depicts the relationship between a first 
quadrant point P and the origin O using both rectangular and polar coordinates. All of the r y 
equations in the box above can be read directly from the diagram. 

We can use these relationships to give an algebraic explanation why the polar coordinates O x = 


(—r, 0+180°) give the same point as (r, 0). Converting (—r, 0+ 180°) to rectangular coordinates, 
we have 


x = (-r) cos(@ + 180°) = (—r)(— cos 0) = rcos 0, 
y = (-r) sin(0 + 180°) = (—r)(—sin 0) = r sin 0, 
which are the rectangular coordinates of (r, 0) as well. 


Now that we understand how to locate points in polar coordinates, let’s explore graphing equations in polar 
coordinates. When we graph an equation that has the variables x and y, the graph consists of all points (x, y) on 
the Cartesian plane that satisfy the equation. Similarly, the graph of an equation that has the variables r and 0 
consists of all points with polar coordinates (r, @) that satisfy the equation. 


Problem 5.11: 
| (a) What is the graph of r = 5? 


(b) What is the graph of 0 = 20°? 


Solution for Problem 5.11: 


(a) The graph of r = 5 consists of all points that are 5 units away from the origin. Therefore, the graph is the 
circle with radius 5 centered at the origin. 


90° 90° 


160" alas ape 
y or, ya j 
210° \ SP ale- < / 330° 
300° 
270° 270° 
Figure 5.1: Graph of r = 5 Figure 5.2: Graph of 0 = 20° 


(b) Any point (r, 0) with 0 = 20° and r > 0 in polar coordinates must be on the ray from the origin that makes 
a 20° angle with the positive x-axis such that the ray is in the first quadrant. Conversely, any point on this 
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ray can be expressed in polar coordinates with an angle of 20°. We must also include points for which 
the radius is 0 (the origin) or negative. If the radius r is negative, then the point (r,20°) is in the direction 
opposite the aforementioned ray. In other words, when r is negative, the point (r, 20°) is on the line formed 
by continuing this ray past the origin into the third quadrant, as shown at right above. Every point on this 
ray in the third quadrant can be written as (s, 200°) for some positive value of s, and such a point can also 
be written as (—s, 200° — 180°), or (—s,20°). Therefore, the graph of 0 = 20° is a line through the origin that 
passes through the first and third quadrants and makes an angle of 20° with the x-axis. 


Having tackled the simplest equations, those in which r or @ is constant, let’s look at an equation that includes 
both r and @. 


| Problem 5.12: 
(a) Graph the equation r = 6sin 0. Use your graph to guess the rectangular form of this equation. 


(b) Prove that r = 6sin @ is equivalent to the rectangular form you find in part (a). 


Solution for Problem 5.12: 
(a) We get a feel for the graph by finding r for several values of 8. When @ = 0, we have r = 0, so the graph 


passes through the origin. When 0° < @ < 180°, we have r > 0, so there are points in the first and second 
quadrants in the graph, corresponding to 0° < @ < 90° and 90° < @ < 180°, respectively. The largest value 
of r occurs when sin @ = 1, which is when @ = 90°. This gives r = 6, so the point 6 units directly above 
the origin is on the graph. Since 6 sin @ increases from 0 to 6 as @ goes from 0° to 90°, and then goes back 
down from 6 to 0 as @ goes from 90° to 180°, the graph goes from the origin through the first quadrant to 
the point 6 above the origin ((0, 6) in rectangular coordinates, (6,90°) in polar coordinates), and then back 
down through the second quadrant to the origin. 


When 180° < @ < 360°, the value of 6sin @ is negative. As we pick a few values of @ in this range, we 
notice that each produces a point we already found when examining 0° < @ < 180°, since a negative radius 
means the point is in the direction opposite 0. For example, if @ = 210°, we have r = 6sin@ = —3, which 
gives us the point (—3, 210°). But this point is the same as (3, 210° — 180°), which is (3,30°), a point we already 
know is on the graph. Similarly, since sin(a + 180°) = — sina, we can write the point (6 sin(a + 180°), a+ 180°) 
as (-6sina,a@ + 180°), which is the same as (6sina, a). Therefore, we only have to worry about the angles 
0° < 0 < 180° when constructing the graph of r = 6 sin 0. Our table and the corresponding points are below. 


o (r, 8) 90° 
(0,0°) 60° 
30° | (3,30°) 
45° | (3-V2,45°) SON 30° 
60° | (3 V3, 60°) A 
90° | (6,90°) Dean \ | 
120° | (3 V3, 120°) 180° PUSTET TIN 
185° V (3 V2,135°) aa 8: 
150° | (3,150°) ! a 
210° 330° 
300° 


270° 


These points suggest that our graph is the circle with center (0,3) and radius 3. We know that in rectangular 


coordinates, an equation of the form 
(x-h?+(y-ke =P 
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has a graph that is a circle with center (h, k) and radius t, so we guess that the graph of the equationr = 6 sin 8 
is the same as the graph of the equation x? + (y — 3)? = 3°. 


(b) Now that we have some clue what we’re looking for, we know just how to look for it. We wish to show 
that the graphs of r = 6sin@ and x° + (y — 3)? = 3? are the same. We start by looking at the equations we 
found in Problem 5.10 to relate polar and rectangular coordinates: 


x=roos@, y=rsn0, P=x+y, tano=. 
2 


Our goal is to use these relationships to convert r = 6sin@ to x? + (y — 3)? = 3? (or vice versa). First, we 

note that the origin is in both graphs. For all other points, we have r > 0, so we don’t have to worry about 

inadvertently multiplying or dividing by 0 when we multiply or divide an equation by r. Here are a couple 

wee that ue on convert the polar coordinate equation r = 6sin 0 to the rectangular coordinate equation 
da (=O) eee 


Method 1: Substitution for sin 0. We write y = rsin 0 as sin@ = 4. Substituting this into r = 6sin @ gives 


r= oy Multiplying both sides by r gives r? = 6y. Aha! We have an equation for 7° in terms of x and y. 
Substituting 7? = x° + y? gives x? + y? = 6y. Rearranging this gives x? + y* — 6y = 0, and completing the 
square in y gives x” + (y — 3)? = 3?. 

Method 2: Manipulation to introduce r° and r sin 8. We know that we can convert r sin 8 into y, and convert 
r? into x? + y*, so we multiply both sides of r = 6sin@ by r to give 7? = 6rsin@. We can now write this 
equation as x* + y* = 6y, and finish as before. The key step in this second approach is trying to force the 
expressions r sin @ and 7° into the equation, since we can then eliminate the polar variables by substituting 
the equivalent rectangular variable expressions. 


We also could have started with the rectangular coordinate equation x” + (y — 3)? = 3? and converted it 
to the polar coordinate equation r = 6sin 9. We start by expanding and rearranging x? + (y — 3)* = 3? to get 
x? + y? = 6y. Then, for any point (x, y), we can let r? = x? + y* and y = rsin@, and we have r° = 6rsin 0. 
Dividing 7? = 6r sin 0 by r gives r = 6sin 0. 

We conclude that the graph of r = 6sin @ is the same as the graph of x? + (y — 3)? = 3?, which is a circle 
with radius 3 centered at the point with rectangular coordinates (0,3). 


Problem 5.13: In rectangular form, the general form of an equation whose graph is the circle with center (h, k) 
and radius t is 


(x—h)? + (y - k? =f; 


Find the polar form of this equation. That is, find the general form of an equation whose graph is the circle 
with center (ro, a) (in polar coordinates) and radius £. 


Solution for Problem 5.13: Our goal is to turn the equation in terms of x, y, h, and k into an equation in terms of 
r, 0, ro, and a. Again, our equations from Problem 5.10 are the key. Here, the conversion is a bit easier than in 
Problem 5.12. Since we are given the rectangular form, we can simply substitute x = r cos 0 and y = rsin@ to 
replace the rectangular variables (x, y) with the polar variables (r, 0). This gives us 


(rcos0 -h° +(rsind—k? =P. 


However, we still have to get rid of h and k—these are rectangular coordinates as well. Just as we can use 
= rcos@ and y = rsin to convert the variables (x, y) to the variables (r,@), we can use them to express the 


center’s rectangular coordinates (h, k) in terms of its polar coordinates (ro, œ). We therefore have h = ro cosa and 
k = ro sina, and the equation of our circle is 


(rcos @ — rocos a)” + (rsin@ — rosina)? = #. 
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Expanding the squares gives 
1? cos? @ — 2rry cos O cosa + rå cos? a + 1 sin? O — 2rro sin O sin a + DEn aE 


Seeing squares of cosine and sine makes us look for ways to add them to get 1. Fortunately, factoring does the 
trick both for both 0 and a. We can rewrite the equation as 


r (cos? @ + sin? 0) — 2rro(cos 0 cos a + sin @ sina) + r3(cos* a + sin yt 
Applying trigonometric identities to each of the three expressions in parentheses, the equation becomes 


r — 2rrcos(0 - a) +n =P. 


Problem 5.14: 
(a) What are the oe coordinates of the point that results when CE V2,5 V6 Au is ned 2u counter- 
clockwise about the origin? 


(b) Show that if the point (x, y) is rotated O counterclockwise about the origin, the result has rectangular 
coordinates (x cos 0 - y sin 0, y cos @ + xsin 0). 


Solution for Problem 5.14: 


a) Aside from the name of this section, we have a number of clues to try polar coordinates. First, we don’t have 
yP 

good tools for handling rotations in rectangular coordinates (aside from rotating by multiples of 5). Second, 

rotation has to do with angles, and we have angles in polar coordinates, not in rectangular coordinates. So, 


we start by converting (5 V2,5 V6) to polar coordinates. We have r? = x? + y? = 50 + 150 = 200, so we can 
let r = 10 V2. We also have tan O = * = 3. Since the point is in the first quadrant and we are using the 
positive value of r, we have 0 = 3. Therefore, the point (5 2,5 V6) in rectangular coordinates is the same 
as the point (10 2) z) in polar coordinates. 


Finding the result of rotating (10 ve z) counterclockwise 2 about the origin is easy. We simply add 
2 to the angle, since rotation about the origin changes the angle by the amount of the rotation, but it 
oe t change the distance from the point to the origin. So, rotating (10 v2 v2, z) counterclockwise gives 
us (10 ¥2,7). Converting this back to rectangular coordinates gives (—10 v2, 0). 


(b) We use our first part as a guide. 


‘Concept: | When proving a formula, it’s often useful to work out a specific example, and 


usin use the aS as a guide to find the eee 


Let s a) z e polar earainates of the pois with rechaaeulan coordinates (a y). Rotating (r, mm counter- 
clockwise about the origin by 0 gives the point with polar coordinates (r,a+ @). The rectangular coordinates 
of this point are (r cos(a@ + @),rsin(a + @)). Applying angle sum identities, we can write these coordinates as 


(rcosacos 0 —rsinasin 6,rsinacos O + r sin @ cosa). 


That’s not quite what we were looking for—the result we wish to prove has x and y, and no r or a. Since 
(x, y) in rectangular coordinates is (r, œ) in polar coordinates, we have x = rcosa and y = rsina. We can 
substitute x for r cos a and y for sina in the long coordinates above to give the simpler-looking 


(xcos 0 — ysin 0, ycos 0 + xsin 0). 
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Important: The image of the point (x, y) upon a rotation @ counterclockwise about the 


VY origin is (xcos 0 — y sin 0, y cos 8 + xsin 8). 


Problem 5.15: Leth and k be constants, and let a and b be positive constants with a > b. The graph of an 
equation of the form 


@-h? | (y-h? 
JŽ ye 
is an ellipse whose major axis has length 2a. Every ellipse in the Cartesian plane with a horizontal major axis 
is the graph of an equation of this form. 
The graph of the equation 7x? — 6 V3xy + 13y? 


=1 


= 64 is also an ellipse. Find the length of its major axis. 


Solution for Problem 5.15: Let &be the graph of 7x*—6 V3xy+13y? = 64. If we can manipulate 7x2—6 V3xy+13y" = 64 


ms Lays 
into the form a ore ee = 1, then we can easily find the length of the major axis. Unfortunately, the xy term is a 


problem. There is no xy term in the form we're given for an ellipse with a horizontal major axis, so & must be an 
ellipse whose major axis is not horizontal. If our ellipse had a horizontal major axis, we could solve the problem 
quickly. So, we try to relate & to an ellipse with a horizontal major axis that is the same length as the major axis of 


8. 


Rotation does not affect length, so we can view & as the result of rotating an ellipse S’ with a horizontal major 
axis. Suppose the required rotation is a counterclockwise rotation by 9, so that for every point (x’, y’) on &’, there 
is a point (x, y) on & that is a counterclockwise rotation by 0 of (x’, y’). From the result of Problem 5.14, we have 


x=x'cosd-y' sin, 
y=x'sin@ + ycos@. 


Substituting these into 7x* — 6 V3xy + 13y = 64 produces 
7(x’ cos 0 — y’ sin 0}? — 6 V3(x' cos 0 — y’ sin @)(x’ sin O + y’ cos 0) + 13(x’ sin @ + y’ cos 0}? = 64. (5.1) 


Yikes. Multiplying that out looks scary. Fortunately, we don’t have to do so just yet. Our goal is to eliminate the 
x’y’ term. So, we just look at the x’y’ terms in the expansion of the left side, which are 


~14x’y’ cos 8 sin 6 — 6 V3x'y' cos? 6 + 6 V3x'y' sin? 6 + 26x’y’ cos O sin 8. 
We want these terms to cancel, so we must have 
—14cos 8 sin 0 — 6 V3 cos? 0 + 6 V3 sin? 6 + 26 cos Ø sin 0 = 0. 


Simplifying a bit es us 12 cos 0 om 0-6 V3 3(cos? 0 — sin? 0) = 0. Now we see a chance to use trig identities. We 
have cos Osin @ = § sin 28 and cos’ 6 — sin? 0 = cos 20, so we can write the equation as 


6sin 20 — 6 V3cos 26 = 0, 


from which we have (sin 20)/(cos 20) = v3, or tan20 = V3. Since tan 60° = v3, we find that 20 = 60°, so @ = 30°. 
This means that & is a 30° counterclockwise rotation of & about the origin. 


We can now find the equation whose graph is &’. Substituting 0 = 30° into Equation (5.1) gives 


r(x ey I 616 (x e z) (eger E) +(x dey 9) = 64. 
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Multiplying both sides by 4 gets rid of the fractions and gives 
7(V3x’ — y — 6 V3(V3x" — y’)(x’ + VBy’) + 13(x’ + V3y’)* = 256. 
Expanding and then simplifying the left side gives 
16(x’)? + 64(y’)? = 256. 


The x’y’ terms all canceled out, as expected. Dividing both sides by 256 puts the equation in the form of an ellipse 
described in the problem, 


ToT wae 
Comparing this to 
2 y 
2 + R = ll, 


we find a = 4, so the length of the major axis is 2a = 8. O 


L _txercises ig 
5.2.1 Convert each of the following rectangular coordinates to polar coordinates. (You may use a calculator if 
necessary.) 

(a) (=5, 0) 

(b) (6V2,6) 

(c) (-4, —4 v3) 


5.2.2 Convert each of the following polar coordinates to rectangular coordinates. 


(a) (7,210°) 
(b) (9,—180°) 
(c) (-8, on) 
4 
2cos 0 — 5sin6` 


5.2.4 The final form we found in Problem 5.13 for an equation whose graph is a circle does not match the form 
of the equation we graphed in Problem 5.12. But the graph we found in Problem 5.12 is indeed a circle. This 
suggests we can write the equation we graphed in Problem 5.12, which is r = 6sin 90, in the form we found in 
Problem 5.13, which is 7? — 2rro cos(0 — a) + r3 = t. Show that we can do so. 


5.2.3 Graph the equation r = 


5.2.5 Let A and B be points with polar coordinates (6, 137°) and (12, 0), respectively, and let O be the origin. 


(a) Find all possible values of 0 with 0° < 0 < 360° such that AO BO. 
(b) Find all possible values of O with 0° < @ < 360° such that AB L AO. 


5.2.6 What does the graph of r = 2 + 30 look like, where 0 2 0 and we express 0 in radians when computing r? 


5.2.7 A regular hexagon with center at the origin has one vertex at (3,3). Find all vertices of the hexagon that are 
in the second quadrant. 
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5.3 Coordinates in Three Dimensions 


Just as we use the horizontal and vertical coordinates (x, y) to identify the location 
of a point in the two-dimensional Cartesian plane, we can identify the location of a 
point in three-dimensional Cartesian space with three coordinates (x, y,z). We call 
the plane in which z = 0 the xy-plane. Points on this plane are identified by the 
coordinates (x, y,0), where the x and y have the horizontal and vertical meanings 
they have in the Cartesian plane. For other points (x, y, z) in space, the z-coordinate 
indicates how far above (z > 0) or below (z < 0) the xy-plane the point is. The x- and 
y-coordinates of (x, y,z) are the same as those of the point in the z = 0 plane that is 
directly above or below the point (x, y,z). 


As shown in the diagram at right, we have a third axis in three dimensions, which we call the z-axis. In this 
diagram we depict the point (4,3,5). We include edges of a right rectangular prism with opposite vertices at 
(0,0, 0) and (4, 3,5) to help show the location of (4, 3,5) in space. 


Just as all points (x, y,z) with z = 0 form the xy-plane, the yz-plane consists of all points of the form (0, y, z), 
and the xz-plane consists of all points of the form form (x, 0, z). 


In most of the three-dimensional diagrams we include in this book, we’ll omit the negative portions of each 
axis, and remove the tick marks and numbers along the axes, since these diagrams can get very cluttered if all of 
this information is included. Sometimes, we’ll even exclude the axes, and just include the key geometric figures. 


| Probiems i 


Problem 5.16: We start our exploration of coordinates in three dimensions by finding a formula for the distance 
between two points in space. 
(a) Let A be (—3,4,6) and B be (4, -1,2). Find AB. 


(b) Find a formula for the distance between the points (x1, y1, z1) and (x2, y2, Z2). 


Problem 5.16: 
| (a) Let A be (—3, 4,6) and B be (4, -1,2). Find AB. 


(b) Find a formula for the distance between the points (x1, y1, 21) and (x2, y2, Z2). 


Solution for Problem 5.16: 


(a) We find AB by considering the rectangular prism with A 
and B as opposite vertices and with each edge parallel to a 
coordinate axis, as shown at right. We can then find AB by 
applying the Pythagorean Theorem twice. As shown in the 
diagram, the points (4,4, 2) and (—3, 4, 2) are vertices of the 
prism. Let these be C and D, respectively. Right triangle 
BCD gives us B(4, -1, 2) 


BD = VBC? + CD? = V25 + 49 = V74. 
Then, right triangle ADB gives us 


AB = VBD? + AD? = V74 + 16 = 3 V10. 


A(-3, 4,6) 


D(-3, 4, 2) 


C(4, 4, 2) 
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(b) We follow the same strategy as in part (a). We construct the 
rectangular prism with (x1, y1, Z1) and (x2, y2, Z2) as opposite 
vertices and with edges parallel to the coordinate axes, as 
shown. From right triangle BCD, we have BD? = CD? +BC?, 
and from right triangle ABD, we have 


A(x2, Y2, z2) 
Iz2 — zıl 


D(x2, 2,21) 


B(x, Yi, Z1) 
ly2 - yıl 

so AB = VCD? + BC? + AD?. Writing CD, BC, and AD in C(x1, Y2, z1) 

terms of the coordinates as shown, we see that the distance 

between (x1, Yi, Z1) and (Xo, Y2, Z2) is 


AB? = BD? + AD? = CD? + BC? + AD’, AT 


(x2 = x1} + (y2 — y1} + (z2 - z1}. 


-The distance between the points (x, y1, z1) and (x2, y2,22)i8 o 


vV 2 e R 


Important: 


v 


Problem 5.17: Describe the graph of the equation x* + y* + z* — 6x — 4y + 8z =7. 


Solution for Problem 5.17: The equation strongly resembles a two-variable equation whose graph is a circle, so we 
suspect the graph is a sphere. When confronted with similar equations with two variables, we typically complete 
the squares in the quadratics, so we try that here. Completing the square in x, in y, and in z gives us 


(x — 3)? + (y— 2)? +(z +4 =7+9+4+16=36. 


The left side looks a lot like the distance formula in three dimensions. In fact, the expression on the left equals the 
square of the distance between (x, y, z) and (3, 2, —4). Since the right side is 67, the equation tells us that the distance 
between (x, y, z) and (3,2, —4) is 6. Sure enough, the graph of all such points is a sphere with center (3, 2, —4) and 
radius 6. 0 


Just as Cartesian coordinates are not the only way to describe the location of a point in a plane, the rectangular 
coordinates (x, y, z) described above are not the only way to describe the location of a point in space. We'll now 
introduce two other ways to describe the location of a point in space. 


The first is cylindrical coordinates. In cylindrical coordinates, the x- and y- 
coordinates are replaced by r- and 6-coordinates such that the point in the Cartesian 
plane with rectangular coordinates (x, y) has polar coordinates (r, 0). The third cylin- 
drical coordinate is the same z-coordinate as in rectangular coordinates. A depiction 
of the point (r, 0, z) in cylindrical coordinates is shown at right. When expressing a 
point in polar coordinates, the radius always comes first, the angle second, and the 
z-coordinate last. 


Relating the cylindrical r- and 0-cylindrical coordinates to rectangular coordinates 
in three dimensions is the same as relating polar coordinates (r, 0) to rectangular 
coordinates (x, y) in two dimension. If the cylindrical coordinates (r, 0, z) represent 
the same point as the rectangular coordinates (x, y, z), then x = r cos 0 and y = rsin 0. 
As with polar coordinates, we usually choose r to be nonzero and @ such that 0 < @ < 2m. However, r can be 
negative and @ can be outside the interval [0, 27), just as with polar coordinates. 
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Cylindrical coordinates aren’t a natural analogue to polar coordinates. When a point is expressed in polar 
coordinates, the absolute value of the r-coordinate is the distance from the point to the origin, but in cylindrical 
coordinates, the r-coordinate gives the distance from the point to the z-axis, not the origin. A more natural 
analogue in three dimensions to the polar coordinates of two dimensions starts with letting the first coordinate 
represent the distance from the origin to the point. We do just that when we describe a point’s location with 
spherical coordinates. 


The first spherical coordinate is p, pronounced “roh,” which is the 
distance from the point to the origin. But specifying the distance from the 
origin to a point is not enough to determine exactly where the point is. 
In order to describe the location of the point, we need a pair of angles as 
well. The first of the two angles we use to describe a point P in spherical 
coordinates is the same as the angle in cylindrical coordinates. We think 
of the half-plane that starts from the z-axis and extends through P. The 
angle 0 is the angle between this plane and the positive x-axis, as shown, 
like the angle 6 in cylindrical coordinates. We define the second angle 
p, pronounced “fee,” as the angle between OP and the positive z-axis, 
as shown. Therefore, every point in space can be described with a triple 
(p, 9,0), with p>0,0< 0<2nand0<¢<x. 


While we usually restrict p, 0, and @ such that p >0,0<@<2nxand x 
0 < @ < m, we don't have to. Allowing p to be negative has the same effect as allowing r to be negative in polar 
coordinates. When r is negative in polar coordinates, the point is in the direction from the origin that is opposite 
the direction described by the 6-coordinate. When p is negative in polar coordinates, the point is in the direction 
from the origin that is opposite the direction described by the 0- and ¢-coordinates. So, for example, the point 


(1,0, 1) can be described as both (v2, 0, z) and (- V2, 7, sn) in spherical coordinates. 


When reading about spherical coordinates in another source, make sure you 
know what each variable stands for and what the restrictions on each are. 
While most sources will define the two angles in spherical coordinates in 
essentially the same way we have here, some will use different variables 
than we have. Even more confusing, some will use the same variables, but 
reverse their meanings. Furthermore, some will restrict p to be nonnegative, 
and some will use r rather than p. 


|| Problems BEG 


Problem 5.18: 
(a) Describe the graph of all points with cylindrical coordinates (r, 0, z) such that r = 7. Express the equation 


in rectangular coordinates. 


WARNING! 
ry" 


(b) Describe the graph of 0 = 7 in cylindrical coordinates. Express the equation in rectangular coordinates. 


Problem 5.19: Suppose (p, 8, Q) are the spherical coordinates of point P, where p > 0, and that (x, y,z) are the 
rectangular coordinates of point P. 

(a) Find p in terms of x, y, and z. 

(b) Let O be the origin and A be the foot of the altitude from P to the z-axis. Find AO in terms of p, Ø, 
and/or @. 
Find x, y, and z in terms of p, 0, and @. 


(c) 
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Problem 5.20: 
(a) Convert the cylindrical coordinates (4, ae 4) to rectangular and spherical coordinates. 


)} Convert the s herical coordinates (7, 2, =) to rectangular and cylindrical coordinates. 
R ae 8 y 


Problem 5.21: Convert each of the following rectangular coordinates to cylindrical coordinates and to spherical 
coordinates. 


(a) (6,-6,-2 V6) (b) (-6,0,6 V3) 


Problem 5.22: 
(a) Describe the graph of the equation E =x oh”, 


(b) Find parametric equations for x, y, and z for the equation ue = x* + 42. (You will need two parameters.) 


Problem 5.23: 
(a) Describe the graph of all points with spherical coordinates (p, 8,¢) such that p = 5. Express the equation 
in rectangular aaa and give a parameterization of the equation for x, y, and z. 


(b) Describe the graph of @ = 4 in a spherical coordinates. Express the equation in rectangular coordinates. 


For the final problem in this section, we need to understand how we determine the latitude and longitude 
that we typically use to locate points on the Earth’s surface. To do so, we start with a great circle of a sphere, 
which is a cross-section of the sphere that has the same center as the sphere. We choose the great circle that is in 
the plane perpendicular to the axis of the Earth to be the equator of the earth. The equator has latitude 0°, and it 
divides the Earth into the Northern hemisphere and the Southern hemisphere. 


The North and South Poles of the Earth are the points at which the axis Noain ede 
intersects the surface of the Earth. To define longitude, we first choose one 
semicircle along the surface of the Earth, with the poles as endpoints, to be the 
prime meridian. We say that the longitude of any point on the prime meridian 
is 0°. For any point P not on the prime meridian, we determine its longitude by 
considering the plane through P parallel to the equator. Let this plane intersect 
the axis at A and the prime meridian at M. Then, ZPAM is the longitude of P. 
We have one little wrinkle here—we add a direction to the longitude to tell us 
which way to go, east or west, to get to P fastest from M. So, if ZPAM is 45° 
and we go west from M to get to P, then the latitude of P is 45°W, where the 
W stands for “west.” 


South Pole 


Now that we have longitude, we can use it to define latitude. First, we say 
the North and South Poles have latitudes 90°N and 90°S, respectively. For any 
other point T not on the equator, we let E be the point on the equator with the 
same longitude as T and let O be the center of the Earth. Then, ZTOE gives 
us the latitude, where we assign the latitude the designation “N” if the point 
is in the Northern hemisphere and the designation “S” if it is in the Southern 
hemisphere. So, if ZTOE = 30° and T is in the Northern hemisphere, we say 
that its latitude is 30°N. 


Note that the equator of the Earth is determined by the rotation of the Earth, 
whereas the prime meridian is an arbitrary choice. It was originally established 
by Sir George Airy as the Greenwich Meridian in 1851, and was established as 
the international standard by the International Meridian Conference in 1884. 
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It is now marked by a laser beam pointing towards the North Pole from the Royal Observatory in Greenwich, 
London. 


Problem 5.24: 
(a) What is the relationship between spherical coordinates and the latitude and longitude that we use to 
identify the location of points on the surface of the Earth? 


(b) Why do we use a system much like spherical coordinates for points on the Earth’s surface rather than 
using rectangular coordinates? 


(c) Petr has lived his whole life in Petropavlovsk-Kamchatsky on the frigid northeastern shores of Siberia. 
He is sick of the cold, and has spent the last 5 years building a sailboat he hopes to sail to Honolulu, 
Hawaii, where he will be much warmer. If he sails at 10 km per hour, how many days will it take him 
to sail from Petropavlovsk-Kamchatsky, which has coordinates 52.92° N, 158.49° E, to Honolulu, which 
has coordinates 21.31° N, 157.83° W? You may assume the Earth is a sphere with radius 6370 kilometers, 

and that Petr travels along a single great circle of the sphere throughout his trip. 


“Problem 5.18: 
(a) Describe the graph of all points with cylindrical coordinates (r, 0,z) such that r = 7. Express the equation 
in rectangular coordinates. 


(b) Describe the graph of the points in space such that 0 = 3 in cylindrical coordinates. Express the equation f 
in rectangular coordinates. 


Solution for Problem 5.18: 


(a) Intwo dimensions, the graph ofr = 7 is a circle centered at the origin with radius 7. 
In three dimensions, we can let z be any real number when we produce the graph 
of r = 7. Therefore, every cross-section of the graph perpendicular to the z-axis is 
a circle with radius 7 centered at a point on the z-axis. Since all these circles have 
the same radius, the resulting graph of r = 7 in three dimensions is a cylinder that 
extends forever in both directions of its axis. The radius of the cylinder is 7, and 
the z-axis is the axis of the cylinder. 


Just as in polar coordinates, we have 


+ y* = (rcos OF + (rsin@) = r (cos? 0 + sin? 0) = 7°. 


So, to express the equation r = 7 in rectangular coordinates, we square the equation to get r? = 49, which 
gives x? + y? = 49. 

(b) In polar coordinates, the graph of 0 = 7 is a line through the origin that makes 
an angle of % with the x-axis such that the line passes through the first and 
third quadrants. Since we can use any value for the z-coordinate in the graph 
of 9 = 3 in three dimensions, the graph of consists of the aforementioned line 
in the plane z = 0, as well as every point “above” or “below” (i.e., with any 
z-coordinate) the line in space. In other words, the graph is a plane, as shown 
at right. 

To convert the equation to rectangular coordinates, we note that x = r cos 0 = 
reos§ = 5 and y =rsin@ = ae, For all r # 0, we can divide these equations 
toget 3 Rearranging this equation gives y = 3x as the equation in 
rectangular form. (Note that points of the form (0,0,z) are on the graph and 
satisfy this equation.) O 
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Problem 5.19: Suppose (p, 0, p) are the spherical coordinates of point P, where p 2 0, and that (x, Y, z) are the 
rectangular coordinates of point P. 

(a) Find p in terms of x, y, and z. 

. (b) Let O be the origin and A be the foot of the altitude from P to the z-axis. Find AO in terms of p, 6, 
and/or @. 


Find x, y, and z in terms of p, 0, and @. 


(c) 


Solution for Problem 5.19: 


(a) The value of p is the distance from the origin to the point (p, @,¢) in spherical coordinates. The distance 
from (x, y, z) to the origin is \/x* + y* + z2. Therefore, we must have p = \/x? + y? + 22. 

(b) In the diagram at right, we have drawn PA perpendicular to the 
z-axis and PB perpendicular to the xy-plane, thereby creating rect- 
angle PBOA. In right triangle POA, we have cos p = 48 = ae sO 
AO = pcos @. 

(c) Since the z-coordinate of A matches that of P, we have z = AO = 
pcos@. To find the x- and y-coordinates of P, we focus on 
point B in the xy-plane. Since PBOA is a rectangle, we have 
OB = PA. From right triangle POA, we have $$ = sing, so 
PA = POsing = psing. Therefore, we have OB = PA = psin®g. 
The x- and y-coordinates of P match those of B. The cylindrical 
coordinates of B are (OB, 0,0), so the rectangular coordinates are 
(OB cos 0, OB sin 8,0). Therefore, the x- and y-coordinates of B and 
P are x = OB cos 0 = psingcos 0 and y = OB sin 0 = psin ọsin 0. 


In summary, we have: 


; Important: The point with spherical coordinates (p 0,0) has rectangular coordinates 


x = psingcos 0, 
y =psingsing, 
z = pcos. | 

Perhaps you’ve noticed that our proof only works if 0 < @ < 3. You'll complete the proof for other 
values of ¢ as an Exercise. 


Problem 5.20: 
| (a) Convert the cylindrical coordinates (4, au 4) to rectangular and spherical coordinates. 


7m n 


a z) to rectangular and cylindrical coordinates. 


| (b) Convert the spherical coordinates (7, 


Solution for Problem 5.20: 


(a) First, we convert to rectangular coordinates. We have x = rcos0 = 4cos § = 2 and y = rsin@ = 4sin Ẹ = 
2 V3. The z-coordinate is the same in rectangular and polar coordinates, so the rectangular coordinates of 
the point with cylindrical coordinates (4, 3,4) are (2,2 ¥3,4). 


Turning to spherical coordinates, the 0-coordinate is the same in both cylindrical and spherical coordi- 
nates. We can use the rectangular coordinates to find ; 


p= x+y? +22 = Va412 416 = 42. 
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Finally, since z = p cos @, we have cos p= : = Z = ae So, we can let @ = F, and (4 V2, ay z) are spherical 
coordinates of the point. 

(b) Asin part (a), we start by finding the rectangular coordinates: 

A ( 2 R 


x = psinġcos0 =7 ($ 


y = psinġsinĝ =7 ($) (-¥) $ n 


z= peosp=7(5] = A 


To convert these to cylindrical coordinates, we note that we already have the z-coordinate from the rectan- 
gular coordinates and the -coordinate from the spherical coordinates. Finally, we have 


so (2, To 7) are cylindrical coordinates of the point. 


o 


| Problem 5.21: Convert each of the following rectangular coordinates to cylindrical coordinates and to spherical 
| coordinates. 


| @ (6,-6,-2 V8 (b) (-4,0,4V3) 


Solution for Problem 5.21: 


(a) In two dimensions, the point with rectangular coordinates (6, —6) has polar coordinates (6 v2, m), so the 
point in space with rectangular coordinates (6, —6, —2 V6) has cylindrical coordinates (6 v2, %,-2 v6) 


Turning to spherical coordinates, we have p = \/x? + y2 +2? = V36 + 36 + 24 = 4 V6. The @-coordinate 
is the same in spherical coordinates as in cylindrical coordinates. To find @, we use the fact that z = pcos @, 


so we have —2 V6 = (4 V6) cos@. Therefore, we have cos@ = -4, so we take @ = 2B This gives us the 
spherical coordinates (4 V6, =, 2m) 

(b) In two dimensions, the point with rectangular coordinates (—4,0) has polar coordinates (4,72), so the the 
point in three dimensions with rectangular coordinates (—4, 0, 4 V3) has cylindrical coordinates (4, 71,4 v3). 


In spherical coordinates, we have p = ,/x? + y? + z? = V16 + 48 = 8, and the 0-coordinate is the same in 
spherical coordinates as in cylindrical coordinates. From z = p cos ¢, we have 4 V3 = 8cos@, so cos@ = 2 
and ọ = %. Therefore, (8, Tl, z) are spherical coordinates of the point. 


Problem 5.22: 


(a) Describe the graph of the equation z? = x? + y’. 
| (b) Find parametric equations for x, y, and z for the equation z? = x? + y*. (You will need two parameters.) 
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Solution for Problem 5.22: 


(a) We can begin by experimenting with different values of z. When z = 0, we have 
x? + y? = 0, so when z = 0, the graph is just a point. Letting z = 1 or z = —1 
gives x? + 42 = 1, which describes a circle with radius 1. Similarly, for larger 
and larger values of z, the resulting equations have graphs that are circles with 
larger and larger radii. 


We might therefore guess that the graph is a pair of cones that open forever 
in both directions. Expressing the surface in cylindrical coordinates makes it 
very obvious that this is the case. We have z? = x*+y? = 7°, soz = +r. Therefore, 
the radius of each circular cross-section of the graph equals the distance from 
the cross-section to the origin. The graph is the pair of cones shown at right. 
Each cone has the z-axis as its axis. 


(b) Consider the upper cone, for which z > 0. In part (a), we saw that every cross-section of the surface with 
constant z-coordinate is a circle with radius r centered at a point on the z-axis, where r = z. So, the x- and 
y-coordinates can be parameterized as x = r cos 0, y = rsin @. This holds for any positive value of z, so we 
can parameterize the upper cone with x = rcos 0, y =rsin@,z =r. 


m porny terization holds for the lower cone, too! If we have x = rcos 90, y = rsin 0, and z = r then 
x+y? =r? = 2? still, so (x, y, z) = (r cos 0, r sin 0, r) is on the circle centered at (0, 0, r) in the plane z=r. As 
0 ranges from 0 to 27, we trace out the whole circle, so the parameterization x = rcos0, y =rsin@,z =r 
generates all the points on the lower cone for r < 0. Finally, taking r = 0 shows that the parameterization 
includes the origin, as well. 


O 


One significant difference in our answer to part (b) above and the parameterizations we performed in Section 5.1 
is that our parameterization in part (b) above required 2 parameters, whereas each parameterization in Section 5.1 
required only 1 parameter. The reason for this difference is that our parameterizations in Section 5.1 were used 
to describe 1-dimensional curves in a 2-dimensional plane, whereas the parameterization of the cone describes a 
2-dimensional surface in a 3-dimensional space. Since we are describing a 2-dimensional surface, we need two 
parameters. 


Problem 5.23: 
(a) Describe the graph of all points with A iE a 0, ġ) such that p = 5. ne the equation 
in rectangular coordinates, and give a pann of the equation for x, y, and z. 


(b) Describe the graph of the points in space such that 6 = $ in spherical coordinates. Express the equation 
in rectangular coordinates. 


Solution for Problem 5.23: 


(a) The graph of p = 5 consists of all points that are exactly 5 units from the origin. Therefore, the graph is a 
sphere centered at the origin with radius 5. To express the equation in rectangular coordinates, we write p 
as \/x? + y? + z*. Since this quantity is always nonnegative, we can square the equation ,/x? + y2 +z? =5 
to produce the more attractive equivalent equation x? + y* + z? = 25. 

We can parameterize this equation using the conversion from spherical to rectangular coordinates. 
Since p = 5, we have the parameterization x = pcos Osing = 5cos sing, y = psinOsing = 5sin O sino, 
z = pcos = 5cos®@. 
(b) Let Obe the origin and P be a point that has ¢ = = in spherical coordinates. The ¢-coordinate te gives us Ee 


angle that OP makes with the positive portion of a z-axis. Therefore, any point on the ray OP has o= = 
when expressed in spherical coordinates. Moreover, we can rotate any point on this ray about the z-axis 
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by any angle to produce another point with the same ¢-coordinate. In other words, we can form a portion 
of the graph by rotating the whole ray around the z-axis, thereby sweeping out a cone. Since we can allow 
p to be negative, the graph consists of two cones, one below the xy-plane (for p > 0), and one above the 
xy-plane (for p < 0). 

We can see for certain that the graph is a cone by considering a particular cross-section perpendicular to 
the z-axis. For such a cross-section, z is fixed. Because ¢ is also fixed by the equation @ = 2, the relationship 
z = pcos @ tells us that p must also be constant for all points on the graph in this cross-section. Moreover, 
since ¢ is fixed for all points in the graph, the relationship z = p cos @ tells us that p is directly proportional 
to z. Finally, for any point in the cross-section, we have 


+ y = (p sin ọ cos 0)? + (p sin ọsin 0)? = (0 sin? )(cos? 0 + sin’ 0) = p? sin? Q. 


So, the cross-section is a circle with radius p sin ġ, which is directly proportional to p, and therefore directly 
proportional to z. Since every cross-section is a circle with radius directly proportional to the magnitude of 
z, the graph of ¢ = 2u is a pair of cones with vertex at the origin and with the z-axis as their common axis. 
We can express the equation in rectangular coordinates by starting from the equation x? + y? = p° sin? @ 
. 2 
above. Since ọ = 2%, we have x? + y? = ¥. We also have p? = xX + p +2, s0 X +y = ICR +27) 


which simplifies as x? + y? = 3z2. (We also could have noted that z = p cos ġ, so psing = z2 = ztan@. 
For @ = , we then have x? + y? = p’ sin? ọ = z? tan? ọ = 322.) 


coso 


Problem 5.24: 
(a) What is the relationship between spherical coordinates and the latitude and longitude that we use to 
identify the location of points on the surface of the Earth? (A definition of latitude and longitude can be J 
found on page 180.) 


(b) Why do we use a system much like spherical coordinates for points on the Earth’s surface rather than 
using rectangular coordinates? 


(c) Petr has lived his whole life in Petropavlovsk-Kamchatsky on the frigid northeastern shores of Siberia. $ 
He is sick of the cold, and has spent the last 5 years building a sailboat he hopes to sail to Honolulu, 
Hawaii, where he will be much warmer. If he sails at 10 km per hour, how many days will it take him § 
to sail from Petropavlovsk-Kamchatsky, which has coordinates 52.92° N, 158.49° E, to Honolulu, which § 
has coordinates 21.31° N, 157.83° W? You may assume the Earth is a sphere with radius 6370 kilometers, § 

and that Petr travels along a single great circle of the sphere throughout his trip. | 


Solution for Problem 5.24: 


(a) Let the center of the Earth be the origin, and the Earth’s axis be the Z 

z-axis, with the North Pole on the positive z-axis and the South 

Pole on the negative z-axis. Then, longitude is closely related North Pole 
to 0 and latitude is related to ġ. If we take the prime meridian 
to be entirely within in the xz-plane, so that the prime meridian 
intersects the positive x-axis as shown, then we can view “east” 
as counterclockwise, so the longitude of a point in the Eastern 
hemisphere equals the angle 0 in spherical coordinates. The points 
in the Western hemisphere correspond to those with values of 0 
for which 180° < 0 < 360°. We can take care of these by noting 
that subtracting 360° from 0 doesn’t change the location of a point 
with this 6-coordinate, and —180° < @ — 360° < 0°. Therefore, the 
-coordinate of any point in the Western hemisphere equals the oem Pole 
negative of its longitude. 
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0 


(b) 


(c) 


Turning to latitude, suppose P4 is a point on the Earth’s surface in the Southern hemisphere and O is the 
origin. The angle between OP, and the positive z-axis (@ in spherical coordinates) is 90° plus the latitude of 
Pı. Meanwhile, suppose point P} is on the Earth’s surface in the Northern hemisphere. The angle between 
OP, and the positive z-axis is 90° minus the latitude of P2. 


If we center the Earth at the origin (and assume the Earth is a sphere), then in spherical coordinates, all the 
points on the Earth’s surface have the same p coordinate. So, in order to tell anyone the location of a point 
on the Earth’s surface, we need only tell them two coordinates, 0 and @. In rectangular coordinates, all 
three coordinates must be specified to determine a point on the Earth’s surface. 


Let the center of the Earth be O, let Petropavlovsk-Kamchatsky be P, and let Honolulu be H. To find the 
amount of time Petr spends sailing, we must find the distance along the surface of Earth from P to H. This 
distance is the arc of a great circle. We know the radius of this circle; it equals the radius of the Earth. We 
only have to find the angle swept out by this arc, which is ZPOH, and then we can find the length of the 
arc. Unfortunately, it’s not so obvious how to find that angle from the latitude and longitude of P and of H. 


However, we do know that PO and OH equal the radius of the Earth, which we'll let be r. If we can 
determine any other angle or the third side length of APOH, then we can find ZPOH. We are essentially 
given the spherical coordinates of P and of H; we can use these to find the distance from P to H using the 
three-dimensional distance formula. 


We let (pp, Op, Qp) and (pp, Ox, Qu) be the spherical coordinates of P and H. The first coordinate is easy; it 
equals the radius of the Earth, so we have pp = py = r. Turning to the 0-coordinate, we apply the reasoning 
of part (a) to the longitudes of P and H. Since P is in the Eastern hemisphere, its 0-coordinate equals its 
longitude, so Op = 158.49° for P. Point H is in the Western hemisphere, so we can take its @ to be the 
negative of its longitude, which gives 0y = —157.83°. Finally, both P and H are in the Northern hemisphere, 
so their @-coordinates are in 90° minus their latitudes. This gives us @p = 37.08° and @y = 68.69°. Now, we 
can find PH by converting these spherical coordinates to rectangular coordinates and using the distance 
formula. We can do both of these in one step using the relationships x = pcos 0 sing, y = p sin @sing, and 
Z=pcos@. Since pp = po = r, we have 


PH = y (rcos Oy sin ġy — rcos Op sin pp)” + (rsin Oy sin hy — rsin Op sin bp)’ + (rcos@y — rcos@p)”. 


Substituting all our values from above, we find that PH ~ 4970 km. Since we have PO, HO, and PH, we 
can use the Law of Cosines to find cos ZPOH: 
PO? + HO? — PH? 
ZPOH = — > x 0.696. 

cos ZPO PONOR) 0.696 
Therefore, ZPOH ~ 45.9°, which means that Petr will sail $2 ~ 0.128 of a great circle during his trip. Since 
the circumference of the Earth is 27(6370) ~ 40000 km, Petr will sail 0.128 (40000) ~ 5120 km. He sails at 10 
kilometers per hour, so he'll sail for 512 hours, which is 21 days and 8 hours. 


That’s a long trip. You might wonder if there’s a formula to find the distance between two points on the 
Earth given their latitudes and longitudes. There is, and you'll prove it on page 480. 


|| Exercises Ba 


5.3.1 Express each of the following rectangular coordinates in cylindrical and spherical coordinates. 


(a) (=2 v2, 2 V2, 4) (b) (3, 0, =o v3) 


5.3.2 Suppose the point with spherical coordinates (p, 0, ġ), where p 2 0, has cylindrical coordinates GG, z). 
Express p in terms ofr, 0, and z. 
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5.3.3 In Problem 5.19 we only proved the relationship between spherical coordinates and rectangular coordinates 
for 0 < @ < 5. Complete our proof by addressing points for which 5<o<snorp=0. 


5.3.4 Show that if p 2 0, then the distance from (p sin @ cos 0, p sing sin 0, p cos @) to the origin is p. 
5.3.5 Let the cylindrical coordinates of P be (r, 0,z). What are cylindrical coordinates of the following points: 


(a) The reflection of P through the xy-plane. 

(b) The rotation of P by n around the z-axis. 

(c)x The reflection of P through the origin. (The reflection of a point X through a point Y is the point Z such 
that Y is the midpoint of XZ.) 


5.3.6 We showed that the graph of ọ = # in spherical coordinates is a pair of cones. Are there any values of c 
for which the graph of ọ = c is not a pair of cones? 


5.3.7 Suppose we used cylindrical coordinates to identify the location of points on the surface of the Earth, with 
the center of the Earth as the origin. 


(a) Would it be possible to travel along the surface of the Earth and keep the r-coordinate constant? The 
0-coordinate? The z-coordinate? 


(b) Is it possible to travel along the surface of the Earth and keep two of the cylindrical coordinates constant? 


5.3.8 


(a) Describe the graph of all points such that p = 4 and @ = ¥ in spherical coordinates. 
(b)x Describe the graph of all points such that 0 = } and p = 6sin¢@ in spherical coordinates. 


5.3.9 Consider the equation (x + 3)? + (y + 5} + (z-1)? = 4. 


(a) Find a parameterization for x, y, and z for all points that satisfy the equation. 


(b)*x Find a parameterization for the portion of the graph of the equation that is on or above the xy-plane. 


5.4 Summary 


Rather than having a single equation that relates x and y, we sometimes define a relationship between x and y 
through an intermediary variable called a parameter. We do so by expressing x and y as functions of a third 
variable, such as t. We call the system x = f(t), y = g(t) a pair of parametric equations, where t is the parameter. 
When we graph parametric equations x = f(t), y = g(t), we graph all points (x, y) such that there is a value of t for 
which x = f(t) and y = g(t). 


In addition to the more common rectangular coordinates (x, y), polar coordinates 
offer another method for denoting the location of a point on a plane. As with 
rectangular coordinates, we start with a point we call the origin and we identify the 
location of each point with an ordered pair, (r, 0). The r-coordinate is the distance 
from the point to the origin; we call this coordinate the radius. The 0-coordinate is 
the angle, which we define in the same way we related angles to points on the unit 
circle. 


We can also use a negative radius, which means “go in the direction opposite the 
one indicated by 0.” For example, we could express A as (—3,210°), which tells us 
that A is 3 units in the direction opposite 210°, which is the direction of 30°. Similarly, the polar coordinates (r, 0) 
and (—r, 0 + 180°) always refer to the same point. We can also use a negative angle, so point B at right could be 
expressed as (2, —120°). 
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If (x, y) in rectangular coordinates is the same point as (r, 0) in polar coordinates, then 


ox = 76080, y=rsin6, Paxrty’, tano = 2, 


where tan @ = * holds only if x + 0. If x = 0 and y + 0, then cos 0 = 0 and ?’ = y’. If x = y = 0, then r = 0 and 0 
can be anything. 


Just as we use the horizontal and vertical coordinates (x,y) to identify the location of a point in the two- 
dimensional Cartesian plane, we can identify the location of a point in three-dimensional space with three 
coordinates (x,y,z). We call the plane in which z = 0 the xy-plane. Points on this plane are identified by the 
coordinates (x, y, 0), where the x and y have the horizontal and vertical meanings they have in the Cartesian plane. 
For other points (x, y, z) in space, the z-coordinate indicates how far above (z > 0) or below (z < 0) the xy-plane the 
point is. The x- and y-coordinates of the point are the same as those of the point in the z = 0 plane that is directly 
above or below the point (x, y, Z). 


Important: The distance between the points (x1, y1, z1) and (x2, y2,Z2) is 


y (x2 - x1)? aP (y2 a E e a) 


There are two commonly used coordinate systems in three dimensions that use angles among their coordinates. 
In cylindrical coordinates, the x- and y-coordinates are replaced with r- and -coordinates such that the point in the 
Cartesian plane with rectangular coordinates (x, y) has polar coordinates (r, 0). The third cylindrical coordinate 
is the same as the z-coordinate from rectangular coordinates. A depiction of the point (r,0,z) in cylindrical 
coordinates is shown at left below. When expressing a point in polar coordinates, the radius always comes first, 
the angle second, and the z-coordinate last. 


If the cylindrical coordinates (r,9,z) represent the same point as the rectangular coordinates (x, y,z), then 
x = rcos@ and y = rsin@. As with polar coordinates, we usually choose r to be nonzero and 0 such that 
0 < O < 2n. However, r can be negative and @ can be outside the interval [0,27), again just as with polar 
coordinates. 


Figure 5.3: Cylindrical Coordinates Figure 5.4: Spherical Coordinates 


The spherical coordinates of a point are typically denoted as (p, 0, @), The p-coordinate is the distance from the 
P to the origin. The 6-coordinate is essentially the same as the angle in cylindrical coordinates. The @-coordinate 
is the angle between OP and the positive z-axis, as shown. We typically choose the spherical coordinates such 
that p > 0,0 < 0 < 27, and 0 < @ < n. The point with spherical coordinates (p, 0, ) has rectangular coordinates 
x= psingcos 0, y = psin ọsin0, z= pcosġ, 


and we have p° = x7 + 4? + 2”. 
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REVIEW PROBLEMS 


Things To Watch Out For! 


WARNING!! 
at 


of x and y, we only know that the graph of the parametric equations 
is part of the graph of the non-parametric equation. To show that the 
graph of the parametric equations is the same as the graph of the non- | 
parametric equation, we must show that every point in the graph of the 
non-parametric equation is in the graph of the parametric equations. 
We can usually do so by showing that for each point (x1, y1) on the 
graph of the non-parametric equation, there is a value of t such that 


x = f(t), yı = g(t). 


The graph of parametric equations is not necessarily the same as that 
j of a non-parametric equation satisfied by the parametric equations. 
Sometimes the parametric equations have restrictions that limit the 
j 
j 
l} 


graph of the parametric equations to only a portion of the graph of the 
non-parametric equation. | 


e When reading about spherical coordinates in another source, make 
sure you know what each variable stands for and what the restrictions 
oneach are. While most sources will define the two angles in spherical 
coordinates in essentially the same way we have here, some will use 
different variables than we have. Even more confusing, some will use 
the same variables, but reverse their meanings. Furthermore, some 
will restrict p to be nonnegative, and some will use r rather than p. 


5.25 Find the graph of the parametric equations x = 3a — 1, y = 2a? —a. 


REVIEW PROBLEMS 


5.26 Find parametric equations for x and y such that x? + 4x + y? — 2y = 11. 
5.27 Find the graph of the parametric equations x = (sin f)(cos t), y = 2 cos? 2t. 
5.28 Describe the curve that results from graphing the parametric equations x = złp, Y = pp- 


5.29 Convert each of the following rectangular coordinates to polar coordinates. You may use a calculator on 
this problem. Approximate angles to the nearest hundredth of a radian. 


(mae?) (b) (6 V3, -6 v2) (c)  (-5,8) 
5.30 Convert each of the following polar coordinates to rectangular coordinates. 


(a) (5,0°) (b) (—4,210°) (c) e= 


5.31 Describe the graph of the equation 7? = 3r cos 0 — 5r sin 0 — 2. 


5.32 
(a) What point results from rotating (4,5) by 60° clockwise about the origin? 
(b) What point results from rotating (4,5) by 60° clockwise about (2,5)? 
(c)* What point results from rotating (4,5) by 60° clockwise about (—2, 7)? 
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5.33 Let point M be the midpoint of the segment with endpoints (r1, 01) and (r2, 02) in polar coordinates. 


(a) Must it always be possible to express M in polar coordinates as (242 nt ,0) for some angle 0? 


(b) Must it always be possible to express M in polar coordinates as (7, +02 ) for some radius r? 
5.34 What symmetry must the graph of f(r, 0) = 0 have if: 

(a) f(r,0) = f(-r, 0) for all r and 0. 

(b) f(r,0) = f(r,—0) for all r and 0. 
5.35 Express each of the following rectangular coordinates in cylindrical and spherical coordinates. 


(a) (2, =2,2 V2) (b) (V3,3,-2 V3) 


(a) Convert the cylindrical coordinates (6, % ae -1) to rectangular coordinates. 
(b) Convert the spherical coordinates (12, F, sn) to rectangular coordinates. 
5.37 Describe the graph of all points in space such that y? + 2? = 9. 
5.38 What is the graph of r + z = 6 in cylindrical coordinates? 
5.39 Describe the graphs of each ofthe following equations: 
(a) r= 5 in cylindrical coordinates. (c) 0 = -5 in cylindrical coordinates. 
(b) p = 3 in spherical coordinates. (d) = an i in spherical coordinates. 
5.40 Describe the surface that is the graph of all points with cylindrical coordinates (r, 0, z) such that 7? + z? = 2z. 
5.41 Describe the surface that consists of all points with spherical coordinates (p, 0, ) such that p = 6 cos @. 
5.42 Describe the graph of p = = ay in spherical coordinates. 


(a) What geometric transformation maps any point with spherical coordinates (p, 0, œ) to a point with spherical 
coordinates (p, 0 + 71, Q)? 


(b) What geometric transformation maps any point with spherical coordinates (p, 0, @) to a point with spherical 
coordinates (p, 0, n — @)? 


Challenge Problems 


(a) Graphr = sin(26). 
(b) Graphr = sin(36). 
(c) Graphr = sin(46). 
(d) Graph r = sin(56). ` 


(e)x Generalize your results from the first four parts: describe the graph of r = sin(k@) for any integer k. 
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5.45 


(a) Find parametric equations for x and y whose graph is a circle with center (h, k) and radius r. 


(b) Find parametric equations for x, y, and z whose graph is a sphere with center (a,b,c) and radius p. (You 
will need two parameters.) 


(c) Find parametric equations for x, y, and z whose graph is a cylinder with radius r and with the x-axis as its 
axis. (You will need two parameters.) 


5.46 Find parametric equations whose graph is the portion of the curve x? — 4x + y? — 2y + 1 = 0 that is above the 
x-axis. Hints: 177, 107 


5.47 Let (x, y) be a point that does not lie on the negative x-axis, and let (r, 0) be the point’s polar coordinates, 
where r > 0 and -rr < 0 < n. Show that 0 = 2 arctan ——=. Hints: 66 ` 

x+ yH 
5.48 Find the smallest distance between the point (1, 2, 3) and a point on the graph of the parametric equations 
X= c= yt 2 = 3 + A 


5.49 The witch of Agnesi, named after mathematician Maria Gaetana Agnesi, 
is a curve that is defined as follows. Let O be the origin, let T be the point (0,1), 
and let / be the line through T parallel to the x-axis. Let C be the circle centered at 
(0, 1) with radius Ł. For any point P on the circle besides O and T, we draw a ray 
from O through P. Let this ray intersect £ at point X. We then draw the altitude 
from X to the line through P parallel to the x-axis. The foot of this altitude, point 
A, is on the witch curve. When we trace out the resulting points A for all possible 
P, and include point T, we get the witch curve, which is dotted in the diagram. 


(a) Find a parameterization for the witch curve. Hints: 23 
(b) Find a function f such that the witch curve is the graph of the function y = f(x). 


5.50x A sphere with radius 2 centered at the origin is painted red. A hole with radius 1 is drilled through the 
sphere along the x-axis. Find a parameterization for the red surface that remains. Hints: 244, 10, 235 


5.51x Coin A with radius 1 is placed on the Cartesian plane so that its center is at the origin. Coin B with radius 
lis placed on the Cartesian plane so that its center is at (0,2), which means that the two coins are tangent at (0, 1). 
Suppose the point on 8 that is initially touching coin A is painted red. We then roll coin 8 counterclockwise 
around coin A such that the two always remain touching, and coin A never moves. 


(a) Find parametric equations for the path of the red point. Hints: 148, 117, 8 


(b) During a full revolution of coin B around coin A, how many revolutions does the red point make around 
the center of coin 8? (The point completes a full revolution about the center of coin B whenever it is again 
directly below the center of 8.) 


(c) Suppose the radius of coin A is n, where n is a positive integer, and that coin $ has radius 1 and is initially 
centered at (0,1 + 1). Answer the questions of part (a) and part (b) if we again roll coin B about the 
circumference of coin A. 
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alte te 


There can be very little of present-day science and technology that is not dependent on complex numbers in one way or 
another. — Keith Devlin 


CHAPTER le ——— 


Basics of Complex Numbers 


The numbers we have used so far in this book are real numbers. These are the numbers that we can plot on the 
number line. One characteristic of a real number is that its square is nonnegative. In particular, there is no real 
number whose square is —1. 


But imagine that such a number exists: that there is some number whose square is —1. This leap brings us to 
a rich area of mathematics that we study in the next three chapters. We use the symbol i to represent a number 
whose square is —1: 
P=- 


A complex number is a number of the form a + bi, where a and b are real numbers and = —1. We call a the 
real part of a + bi and we call b the imaginary part. If b + 0, then a + bi is nonreal. If a = 0, then the complex 
number a + bi is called an imaginary number. (Some sources refer to imaginary numbers as “pure imaginary.”) 
The number 0 is both real and imaginary, and all real numbers and all imaginary numbers are complex numbers. 
Just as we use the symbol R to refer to “all real numbers,” we use the symbol C to refer to “all complex numbers.” 


6.1 Complex Number Arithmetic 


In this section, we cover the basic arithmetic of complex numbers. To add two complex numbers, we add their 
real parts and add their imaginary parts. So, the sum of the complex numbers a + bi and c + di as (a + c) + (b + dìi. 
We define the product of two complex numbers such that multiplication is distributive. So, we have 
(a + bi)(c + di) = a(c + di) + bi(c + di) 
= ac + adi + bci + bd? 
= ac + adi + bci — bd 
= (ac — bd) + (ad + bc)i. 
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J Probiems il 


Problem 6.1: Find all values of z such that z2 + 81 = 0. 


Problem 6.2: Evaluate 2. 


Problem 6.3: Let w = 3 — 2i and z = 2 + 5i. Express each of the following as a single complex number: 


(a) w+z (c) wz 
(b) w-3z (d) 2w?z-—3wz? 


Problem 6.4: Show that complex numbers are commutative under addition and multiplication. In other words, 
explain why w + z = z + w and wz = zw for all complex numbers w and z. 


Problem 6.5: 
(a) Find all real values of k such that the product (3 + 2i)(3 + ki) is a real number. 


(b) Letz =a + bi, where a and b are real numbers, and let Z = a — bi. Prove that z - Z is real. 
= 52; 


2 
Problem 6.6: Write 349) 284 complex number. In other words, find real numbers a and b such that oo =a+bi. 


Problem 6.7: The multiplicative inverse of a number z is the number w such that wz = 1. For example, 0.25 is 
the multiplicative inverse of 4. Show that any nonzero complex number a + bi has a multiplicative inverse, and 
express that inverse in terms of a and b. 


— 5 
Problem 6.8: Find all complex numbers z such that Z 7 - 28%, 


| Problem 6.1: Find all values of z such that z? + 81 = 0. ; 


Solution for Problem 6.1: Isolating z gives z? = —81. We might immediately see that our solutions are z = +91, 


because i? = —1 and both 9? and (—9)* equal 81. We could also “take the square root” of both sides of z? = -81 
and write z = + V-81, where V-81 is a shorthand for i V81. So, we have z = +i V81 = +9i. Another way to think 
about this latter approach is to think of —81 as 97”, since i? = —1. Therefore, we have z? = (9i)*, from which we 


have z = +97. O 


We have to be careful about using notation like V-81. The real-valued 


function f(x) = yx is only defined for x > 0. When we write V-81 as an 
intermediate step to solving an equation like z? = —81, we should keep in 


mind that we mean i V81. Ss 


WARNING!! 
ry 


| Problem 6.2: Evaluate i2, 


Solution for Problem 6.2: We start by experimenting with smaller powers of i, hoping to find a pattern. We have 
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Therefore, we see that: 


Important: The powers of i repeat in cycles of 4: 


5 


hoi ot Ul 


Because 2009 leaves a remainder of 1 when divided by 4, we have i7°% = (4507+ = (#7) = į, g 


Problem 6.3: Let w = 3 — 2i and z = 2 + 5i. Express each of the following as a single complex number: 


(a) wtz (c) wz 


(b) w-3z i (d) 2w*z —3wz* 


Solution for Problem 6.3: 
(a) To add complex numbers, we add the real parts and add the imaginary parts: 
w 2 = (O 21) (24 51) = G+ 2) (21 251) =o oi. 


(b) We have 
w — 3z = 3 — 2i — 3(2 + 5i) = 3 — 2i — 6 — 15i = -3 — 17i. 


(c) We multiply complex numbers using the distributive property, just as we do when we multiply binomials: 


(3 — 21)(2 + 5i) = 3(2 + 5i) — 2i(2 + 5i) = 6 + 15i — 4i — 10 = 6 + 11i -10(-1) = 16 + 11i. 


(d) We save ourselves a little bit of work by factoring the expression first: 2wW°z — 3wz? = wz(2w — 3z). In the 
previous part, we found wz = 16 + 11i. We also have 


2w -3z = 2(3 — 2i) — 3(2 + 5i) = 6 — 4i — 6 — 15i = —19i, 


SO 
wz(2w — 3z) = (16 + 11i)(—19i) = -304i — 209} = 209 — 304i. 


Sidenote: 


We frequently use w and z as variables to represent complex numbers, rather than 
the x and y that are typically used in problems about real numbers. That said, the | 
variable x is so common that you'll see many equations in terms of x that :have 

__nonreal solutions. _ 
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Problem 6.4: Show that complex numbers are commutative under addition and multiplication. In other words, 


explain why w + z = z+ w and wz = zw for all complex numbers w and z. 


Solution for Problem 6.4: We let w =a + bi and z = c + di, and we have 
w +z = (a + bi) + (c + di) = (a + c) + (b + dji = (c +a) + (d + b)i =z +w. 
We also have 
wz = (a + bi)(c + di) = (ac — bd) + (ad + bc)i, 
zw = (c + di)(a + bi) = (ca — db) + (cb + da)i = (ac — bd) + (ad + bc)i, 
so wz = zw. D 


We can similarly show that complex numbers are associative under addition and multiplication, and that 
multiplication of complex numbers is distributive. 


Important: For any complex numbers v, w, and z, we have: 


e w+Zz=Z+ wand wz = zw 


è v+ (w +z) = (v + w) +z and (vw)z = v(wz) 


© v(w +z) =vw t+ vz 

There’s not really anything new here; these are the same properties that you have used for real numbers 
for years. As exemplified in Problem 6.4, we can prove these properties for complex numbers by using the 
corresponding properties for real numbers together with the definitions of addition and multiplication of complex 
numbers. 


| Problem 6.5: 


| (a) Find all values of k such that the product (3 + 2i)(3 + ki) is a real number. 


(b) Letz =a + bi, where a and b are real numbers, and let Z = a — bi. Prove that z - Z is real. 


Solution for Problem 6.5: 
(a) Expanding the given product, we have (3 + 2i)(3 + ki) = 9 — 2k + (6 + 3k)i. The result is only a real number 
when 6 + 3k = 0, so k = —2. 


(b) In part (a) we saw that the product of the nonreal numbers 3 + 2i and 3 — 2i is real. Notice that the only 
difference between these two numbers is the sign of the imaginary part. Let’s see if we can generalize this 
observation by checking whether or not the product (a + bi)(a — bi) is a real number. We find 


(a + bi)(a — bi) = a(a — bi) + bi(a — bi) = a? — abi + abi - PÊ =a? +b, 


which is real because a and b are real. 


Definition: Ifz = a+ bi, thena- bíi is called the conjugate of z, denoted Z. So, we have a + bi = a—bi. Together, 
a nonreal number and its conjugate are referred to as a conjugate pair. 


Important: The product of a complex number and its conjugate is a real number. 
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Note that in order to prove this fact in Problem 6.5, we wrote a complex number in terms of its real and 
imaginary parts by writing z = a + bi. We also used this strategy in Problem 6.4. 


‘Concept: If you are given a problem involving a general complex number z, it may help to. 


Q= ~~ write z in the forma + bi, and then express the problem in terms of a and b. 


We've covered addition, subtraction, and multiplication of complex numbers. Let’s put conjugates to work by 
turning to division. 


= 
Problem 6.6: Write 3 = as a complex number. In other words, find real numbers a and b such that 


2-1 : 
340) 72th 


3-2i 


Solution for Problem 6.6: Knowing that (3 + 2i)(3 — 2i) is real, we multiply the given fraction by 5=5;, 


1, to produce an equivalent fraction with a real denominator: 


which equals 


2-i 2-i 3-21 (2-i)@-2i) 6-41-3142? 4-7i 


ao re es (2G) oe 22 ~ B 
2=i_ 4 7, 
Therefore, we have 777; = e o o 


The fact that zZ is real for all complex numters z gives us a way to divide by a complex number. 


If w and z are complex numbers and z # 0, then we express the quotient w/z 
as a complex number by multiplying both the numerator and denominator by 
the conjugate of the denominator, 2: 


Since zz is always real, we can express wz/(zZ) as a complex number, which 
means we can write it in the form a + bi for some pair of real numbers a and b. 


Problem 6.7: The multiplicative inverse of a number z is the number w such that wz = 1. For example, 0.25 
is the multiplicative inverse of 4. Show that any nonzero complex number a + bi has a multiplicative inverse, 
and express that inverse in terms of a and b. 


Solution for Problem 6.7: We expect that the multiplicative inverse is simply 4, expressed as a complex number 
using the process we learned in Problem 6.6. If we let 


1 1 a=W a—bi 
Teo; een fo eae 


then m B40 
—bi)(a + bi) a+ 
ag Oe a ee 
This only fails if a? +b? = 0. Since a and b are real, their squares are nonnegative, and can only both be 0 if a = b = 0. 
Therefore, if a + bi is a nonzero complex number, then its multiplicative inverse is 4 a o 


WZ = 


Problem 6.8: Find all complex numbers z such that za = 9 = M 
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Solution for Problem 6.8: We get rid of the fraction by multiplying both sides of the equation by the denominator 
of the fraction. This gives 


3z — 5i = (2z + 3)(3 — 21). 
Expanding the right side gives 

zmo — 07 7 OF, 
Putting all the terms with z on the left and the terms without z on the right gives —3z + 4iz = 9 — i. Factoring z out 
of the left side gives z(—3 + 4i) = 9 — i. Dividing both sides by —3 + 4i gives 


9-i  9-i -3-4i -31-33i 31 33 


a 22 Tee Sa a 


Z 


o 


|| Exercises Ea 
6.1.1 Find all z such that 4z2 + 12 = 0. 


6.1.2 Leta = 3 + 4i and b = 12 - 5i. Compute each of the following: 
(a) a-b (c) a*+3a+2 


(b) ab (d) 


6.1.3 Write each of the following in the form a + bi, where a and b are real: 
(a) 2i+7-2i 

a 

Borst) i l 2i 


E e 


6.1.4 Let f(x) = T Find each of the following: 


IO Sr 
(a) fO 
(b) f(-7) 
(c) fli-1) 
6.1.5 What is (i —i-')-!? (Source: AHSME) 


6.1.6 Solve each of the following equations for z: 


z4+31 
= 2 
(a) ER 
1+2i 
= act Si 
i: 3Z 
z ; 
—.=10-4 
(c) 32+ 555 1 


6.1.7 Use the definitions of complex number addition and complex number multiplication to prove that complex 
numbers are distributive. In other words, prove that w(z; + Z2) = wz + wz2 for any complex numbers w, z1, and zz. 


6.1.8 Let S$ =i%+i-", where n is an integer. What are the possible values of S? (Source: AHSME) 


ee a ee 
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6.2 The Complex Plane 


A complex number can be represented as a point in the complex plane. 
Like the Cartesian plane, the complex plane has two axes: a horizontal 
real axis for the real part, and a vertical imaginary axis for the imaginary 
part. The real axis of the complex plane is labeled Re and the imaginary 
axis is labeled Im. Plotting the complex number x + yi on the complex 
plane is the same as plotting the point (x, y) in the Cartesian plane. The 
figure at right shows the numbers 1 + 21, -5 — 31, and 2 — 4i plotted in the 
complex plane. 


Just as the point (0,0) is called the origin on the Cartesian plane, the 
point that represents 0 on the complex plane is also called the origin of 
the complex plane. 


I) Probiems i 


Problem 6.9: 
(a) Letw=2+4 31. Plot w, W, and ~w on the complex plane. 


(b) For any complex number z, how is z related to Z on the complex plane? How is z related to -z on the 
complex plane? 


Problem 6.10: 
(a) Letw=2+4iandz = 3-21. Plot w, z, and w + z on the complex plane. Connect the origin to w, then 
connect w to w + z, then connect w + z to z, then connect z to the origin. Notice anything interesting? 


(b) Suppose w and z are nonzero complex numbers such that w/z is not real. Connect the origin to w, then 
connect w to w +z, then connect w + z to z, then connect z to the origin. What type of quadrilateral must 
result? Why? 


Problem 6.11: 
(a) What is the distance between 3 + 4i and the origin in the complex plane? 


(b) What is the distance between a + bi and the origin in the complex plane? 


Problem 6.12: Let |a + bil = Va? + b2. 
(a) Evaluate |5 — 12:| and |3 — 3il. 


Suppose w = 3 — 5i and z = —2+ 71. Find |w- z|. Find the distance between w and z on the complex plane. 
Notice anything interesting? 


For any complex numbers w and z, how is |w — z| related to the distance between w and z on the complex 
plane? 


Show that if |z| = 0, then z = 0. 


Problem 6.13: Describe the graphs of each of the following: 
(a) [z|=3 
(b) jz-4|=3 
(c) iz+5- 4il = 2V2 


(d) |z- wļ| = c, where w is a complex number and c is real. 


6.2. THE COMPLEX PLANE 


Problem 6.9: For any complex number z, how is z related to Z on the complex plane? How is z related to -z 


on the complex plane? 


Solution for Problem 6.9: We start by experimenting with a specific value of z. 


| Concept: Trying specific examples can be a good way to discover general relationships. 


Suppose z = 2 + 3i. Then, we have Z = 2 — 3i and -z = —2 — 3i. We see that Z is Im 
the reflection of z over the real axis and —z is the reflection of Z over the imaginary . 
axis. Is this just a coincidence? 


To see if this is true in general, we let z = a + bi. Then, we have Z = a — bi 
and -z = —a— bi. So, z and Z have the same real part and opposite imaginary 
parts. Therefore, z and Z are reflections of each other over the real axis. Similarly, 
z and —z have the same imaginary part, but opposite real parts, so Z and —z are 
reflections of each other over the imaginary axis. 


But how are z and -z related? Because z = a + bi and —z = —a — bi, the points 
corresponding to z and -z are in exactly opposite directions from the origin. We also see that z and -=z are the 
same distance from the origin. Therefore, —z is the 180° rotation of z around the origin on the complex plane. 
(Note: We sometimes refer to the 180° rotation of a point A around a point P as the “reflection of point A through 
point P.”) O 


Important: Inthe complex plane, we have the following: 
e The point —z is a 180° rotation of point z about the origin. 


e The points z and Z are reflections of each other over the real axis. 


e The points Z and —z are reflections of each other over the imaginary axis. 


Don’t memorize these relationships! If you understand why they are true, they should become obvious to you. 


| Problem 6.10: Suppose w and z are nonzero complex numbers such that w/z is not a real constant. (In other 
words, there is no real number a such that w = az.) Connect the origin to w, then connect w to w + z, then f 
| connect w + z to z, then connect z to the origin. What type of quadrilateral must result? Why? 


Solution for Problem 6.10: Again, we start with an example. Suppose Im 
w = 2 + 4i and z = 3 — 2i, so w + z = 5 + 2i. In the diagram at right, we 
connect the points as suggested. We appear to have a parallelogram. se og 
; =- w+z=(24+3)+ (4-2) 

To see if this is always the case, we let w = a + bi and z = c + di. So, ee 


we have w +z = (a+c)+(b+d)i. If a + 0, then the slope between w 
and the origin is b/a, as is the slope between w + z and z. If a = 0, then 
both of these lines are vertical lines. In either case, these two sides of 
the quadrilateral are parallel. Similarly, the slopes of the other two sides 
are either both equal to d/c or they are both undefined (when both sides 
are vertical). So, these two sides are parallel as well. Therefore, both pairs of opposite sides of the quadrilateral 
are parallel, which means the quadrilateral is a parallelogram. O 
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Problem 6.11: 


(a) What is the distance between 3 + 4i and the origin in the complex plane? 


‘ (b) What is the distance between a + bi and the origin in the complex plane? 


Solution for Problem 6.11: 


(a) Finding the distance between 3 + 4i and the origin on the complex plane is the 
same as finding the distance between (3,4) and (0,0) on the Cartesian plane. 
We can use the distance formula, or build a right triangle as shown at right, to 


find that the desired distance is /(3 — 0)? + (4— 0)? = 5. 
(b) The distance between a + bi and 0 on the complex plane is the same as the 
distance between (a, b) and (0,0) on the Cartesian plane. So, the desired distance 


is Va2 + b2. 


Just as we use |x| to denote the distance between a real number x and 0 on the number line, we use |z| to represent 
the distance between a complex number z and the origin on the complex plane. We call |z| the magnitude of z. 
The magnitude of a complex number is sometimes called the norm of the number. 

jl 


Important: - The magnitude of z, denoted by izl, equals the distance from z to the origin o 
Vv the complex plane. If z = a + bi, we have 


lel = Va? +b. 


Problem 6.12: 
(a) Evaluate |5 — 12:| and |3 — 3il. 
(b) Suppose w = 3 — 5i and z = -2 + 7i. Find |w — z|. Find the distance between w and z on the complex 
plane. Notice anything interesting? 
(c) For any complex numbers w and z, how is |w — z| related to the distance between w and z on the complex 
plane? 
Show that if |z| = 0, then z = 0. 


Solution for Problem 6.12: 


(a) Wehave [5 — 12i| = 4/52 + (12)? = 13 and [3 — 3i] = 4/32 + (-3)2 = 3 V2. 


(b) We have w—z = 5—12i, and in the previous part we found that |5— 12i| = 13. 
We plot w and z in the diagram at right. We see that the horizontal distance 
between them is 5 and the vertical distance between them is 12, so the 
segment connecting the two points is the hypotenuse of a right triangle with 
legs of lengths 5 and 12. Therefore, the Pythagorean Theorem tells us that 


the distance between w and z on the complex plane is V5? + 12? = 13. 


Notice that the distance between w and z equals |w — z|. Is this a coinci- 
dence? 


(c) No, it’s not a coincidence. Suppose w = a + bi and z = c + di. Finding the distance between w and z in the 
complex plane is the same as finding the distance between (a, b) and (c,d) on the Cartesian plane. So, the 


distance between w and zis 4/ (a — c} + (b —d)?. 
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Because w — z = (a + bi) — (c + di) = (a — c) + (b—d)i, we have 


jw- z| = \/(a—c)* + (b - d}. 


This tells us that: 


Important: The distance between complex numbers w and z on the complex plane is |w- z]. 


(d) Geometrically, the equation |z — 0| = 0 tells us that the distance from z to 0 in the complex plane is 0. 
Therefore, we must have z = 0. 


Algebraically, if we let z = a+ bi, where a and bare real, then the equation |z| = 0 tells us that Va? + b? = 0. 
Squaring both sides gives a? + b? = 0. Since a and b are real, their squares are nonnegative. So, we only 
have a* + b? = 0 if a = b? = 0, which means a = b = 0. Therefore, we again have z = 0. 


Problem 6.13: Describe the graphs of each of the following: 
(a) |z|=3 
(b) |z-4|=3 
(c) |z+5-4i| = 2V2 


(d) |z- w| = c, where w is a complex number and c is real. 


Solution for Problem 6.13: 


(a) If |z| = 3, then the distance from z to the origin of the complex plane is 3. Therefore, z must be on the circle 
centered at the origin with radius 3. Moreover, every point z on this circle satisfies the equation |z| = 3, 
because every point on this circle is 3 units from the origin. Therefore, the graph |z| = 3 describes a circle of 
radius 3 centered at the origin. 


(b) The equation |z — 4| = 3 tells us that the distance between z and 4 in the complex plane is 3, so the graph is 
a circle of radius 3 centered at the point that represents 4. Since 4 = 4 + 0i, the center of the circle is on the 
real axis, 4 units to the right of the origin. 

(c) Writing z +5 — 4i as z — (—5 + 4i), we see that the graph of the equation Im 
Iz — (-5 + 4i)| = 2V2 consists of all points that are a distance of 2 Vrome ee 
the point —5 + 4i. Therefore, the graph is a circle of radius 2V2 centered 
at —5 + 4i. 

(d) Generalizing our work from parts (a)-(c), we see that the graph of the 
equation |z — w| = c consists of all points that are a distance c from the 
complex number w. In other words, the graph is a circle of radius c 
centered at w (shown at right for w = 2 + i and c = 6). 


Exercises 


6.2.1 Plot each of the following in the complex plane: 
(a) 44+71 (b) -6-2i (c) (3+2(-2 + 51) 
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6.2.2 Find the magnitude of each of the following complex numbers: 


(a) 24-7i 
(b) 24+2i-V3 
(c) (1+ 21)(2 +2) 


6.2.3 Let w = 3 + 5i and z = 12 + 2i. Find the area of the convex quadrilateral in the complex plane that has 
vertices w, z, W, and Z. 


6.2.4 Find the distance between the points 4 + 7i and —3 — 17i in the complex plane. 
6.2.5 Show that the midpoint of the segment connecting z; and zz on the complex plane is (z1 + z2)/2. 


6.2.6 

142i 
2i 
6+11i 
11 +6i 
(c) Notice anything interesting? Can you generalize your observations from the first two parts? 


(a) Find the magnitude of 


(b) Find the magnitude of . (You can use a calculator for this part.) 


6.2.7 Four complex numbers lie at the vertices of a square in the complex plane. Three of the numbers are 1 + 21, 
—2 + i and —1 — 2i. What is the fourth number? (Source: AMC 12) 


6.3 Real and Imaginary Parts 


Not only can we produce a geometric representation of complex numbers by considering the real and imaginary 
parts of the numbers, but we can also use the real and imaginary parts of a complex numbers to prove important 
relationships and solve a variety of problems. 


We often use Re(z) to refer to the real part of the complex number z, and we use Im(z) to refer to the imaginary 
part of z. So, for example Re(2 — 4i) = 2 and Im(2 — 41) = —4. 


|| Problems i 
zeaz 


Problem 6.14: Show that Re(z) = = and Im(z) = ae 


Problem 6.15: Let z and w be complex numbers. 
(a) Show thatz +w =Z +0. 
(b) Show that zw =Z . ù. 


Problem 6.16: 


(a) Show that Z = z for all complex numbers z. 
(b) Show that Z = z if and only if z is real. 
(c) Show that Z = -z if and only if z is imaginary. 


Problem 6.17: Solve the equation z + 22 = 6 — 4i for z. . 
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Problem 6.18: 


(a) Prove that zZ = |z|? for all complex numbers z. 


(b) Prove that |zzw| = Įz||w] for all complex numbers z and w. 


Problem 6.19: Suppose |z + w| = 2 and |z| = |z| = 3. Find |z — wl. 


Problem 6.20: In this problem, we find all complex numbers z such that z? = 21 — 201. 
(a) Let z =a + bi in the given equation. Find a system of equations involving a and b. 


(b) Solve for b in terms of a in one of the equations, and substitute the expression you found for b into the 
other equation. 


(c) Solve the equation you formed in part (b) for all possible values of a. 


Find all complex numbers z such that z? = 21 — 20i. 


Problem 6.21: Find and graph all z such that |z — 3] = |z + 2il. 


Problem 6.22: In this problem, we find |z| if the complex number z satisfies z + |z| = 2 + 8i. 
(a) Letr = |z|. Express z in terms ofr. 


(b) What is the real part of z? What is the imaginary part of z? Find |z|. 


(c) Solve the problem again by letting z = a + bi. 
(Source: AHSME) 


| LF _s 
| Problem 6.14: Show that Re(z) = -= and Im(z) = = 


Solution for Problem 6.14: Letz = a + bi so that Z = a — bi. Then 


z+Z _(a+bi)+(a-bi)_ 2a _ 


2 2 an 

z-Z_(a+bi)-(a-bi) _2bi _, 

2i 2i E 
So, Re(z) = a = === and Im@) = b = ŻA. o 


Important: For any complex number z, we have 


Z+Z 


Re(z) = 


Problem 6.15: Let z and w be complex numbers. >- 
(a) Show thatz+w=Z+W. 


(b) Show that Zw =Z-W. 


Solution for Problem 6.15: Letz =a + bi and w = c + di, where a, b, c, and d are real numbers. 


203 


CHAPTER 6. BASICS OF COMPLEX NUMBERS 


(a) Sincez+w=atbit+c+di=(a+c)+(b+d)i, the conjugate of z + w is 
Z+w =(a+c)+(b+ ai =(a+c)— (b+ di. 
Similarly, the conjugate of z is Z = a — bi, and the conjugate of w is w = c — di, and their sum is 
a — bi + c — di = (a + c) — (b + d)i. 


So, we havez +w =Z + W. 


(b) We have 
zw = (a + bi)(c + di) = ac + adi + bci + bd? = (ac — bd) + (ad + bc)i, 


so the conjugate of zw is zw = (ac — bd) — (ad + bc)i. Also, we have 


Z-T = (a — bi)(c — di) = ac — adi — bci + bdi2 = (ac — bd) — (ad + be)i, 


Z+w 


We'll be seeing these two very important relationships again when we study nonreal roots of polynomials in 
Section 6.4. 


Problem 6.16: 


(a) Show that Z = z for all complex numbers z. 


(b) Show that z = z if and only if z is real. 
(c) Show that Z = —z if and only if z is imaginary. 


Solution for Problem 6.16: Letz = a + bi, where a and b are real numbers. 


(a) Since zZ = a — bi, wehavez =a-—bi=a+bi=z. 
(b) Since z = a — bi, the equation z = z becomes a — bi = a + bi. Subtracting a from both sides gives —bi = bi. 


Therefore, we have 2bi = 0, so b = 0. This gives us z =a + bi =a+0i = q, so z is real. This shows that Z = z 
only if z is real. 


We're not finished with our proof! We have only shown that when Z = z, then z is real. (In other words, 
we have Z = z only if z is real.) We have not shown that the equation Z = z is true for all real numbers. In 
other words, we haven't shown that Z = z if z is real. 


To show that Z = z whenever z is real, we let z = a, where a is a real number. We then have Z = 4 = a, so 
Zz = z if z is real. 


Suppose A and B are two mathematical statements. The statement “A if and 


Important: 
| only if B” means both of the following: 


e IfA is true, then B is also true. 
© If B is true, then A is also true. 


Therefore, as we just saw, proving an “if and only if” statement requires proving two statements, not 
just one. 
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(c) Since Z = a — bi, the equation Z = -z becomes a — bi = —a — bi. Adding bi to both sides gives a = —a, so a = 0. 
Therefore, we have z = a + bi = bi, so z is imaginary. This shows that Z = —z only if z is imaginary. 
As with part (b), this is an “if and only if” statement. We must also show that Z = —z if z is imaginary. 
Letting z = bi, we have Z = bi = —bi = —z, as desired. 


E 


Problem 6.17: Solve z + 2z = 6 — 41 for z. 7 = j 


Solution for Problem 6.17: Letting z = a + bi, for some real numbers a and b, the given equation becomes 
(a + bi) + 2(a — bi) = 6 — 41. Simplifying the left side gives 3a — bi = 6 — 4i. Since two complex numbers are equal 
if and only if their real parts are equal and their imaginary parts are equal, we have 3a = 6 and —b = —4. Solving 
these two equations gives a = 2 and b = 4, so z = a + bi = 2 + 4i. O 


Problem 6.18: 


(a) Prove that zz = |z|? for all complex numbers zA 


(b) Prove that |zw] = |z||w| for all complex numbers z and w. 


Solution for Problem 6.18: Letz = a + bi and w = c + di for real numbers a, b,c, and d. 


2 
(a) Since Z = a — bi, we have zZ = (a + bi)(a — bi) = a + b? = (va? + b2) = ja + bil? = |z. 
(b) First, we note that zw = (a + bi)(c + di) = ac — bd + (ad + bc)i. So, we have 


Izw| = lac — bd + (ad + bc) il = (ac — bd)? + (ad + be)? 
= Va — Qabed + b2d2 + a2d2 + 2abcd + b2c2 
= Vae + a2d2 + b2c2 + a2 
= J (2 + B)(c2 + d3) 
r GE T 
= Ja + bille + dil = zlil. 


We could also have used part (a) together with a little cleverness to solve part (b). From part (a), we 


have 
lzw|* = zw(Z0) = zwz- W = (zzZ)(ww) = |zP lwl. 


Since all the magnitudes are nonnegative numbers, we can take the positive square root of both sides to get 
|zzo| = [z|lzI. 


Important: - The relationship zÊ = = zZ can be very helpful i in problems involving the magni- 
Y tudes of complex numbers. : 


Problem 6.19: Suppose |z + w| = 2 and [z| = |w| = 3. Find |z — wl. 
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Solution for Problem 6.19: There’s no clear way to express |z — w| in terms of |z + wl, |z|, and |w], so we write it in 
terms of z, w, Z, and W: 


|z- wh = (z - w)(z - w) = (z — w)(Z — W) = zZ — zŪ — wz + ww. 


Notice that we work with |z — w|? rather than |z — w| to avoid square roots. 


Concept: It’s sometimes easier to work with the square of the magnitude of a complex 


Q=» ~~ number than with the magnitude itself. 


From |z| = |w| = 3, we find that zz = |z|? = 9 and ww = |w|* = 9, so we have 


|z — w|? = 22 — ZW — wz + WW = 18 — 2 — WZ. 


We have one more magnitude to explore. As with the others, we start by looking at its square. We have 
Iz + wl = 4 and 
lz + wÊ = (z + wz +W) = zZ + zW + WZ + WW = 9 + z0 +w +9, 
so 18 + z@ + wz = 4, which means zW + wz = —14. Therefore, we have |z — w|? = 18 — (zW + wz) = 18 — (-14) = 32, 
from which we find |z - w| = 4 V2. 0 


| Problem 6.20: Find all complex numbers z such that z? = 21 — 20i. j 


Solution for Problem 6.20: We might start by taking the square root of both sides, so that z = + V21 — 20i. But what 
does V21 — 20i really mean? The real-valued function f(x) = yx is only defined for x > 0. Unfortunately, this 
doesn’t help us define the square root of a nonreal number. Instead of trying to define ¥21 — 201, let’s look for 
complex numbers whose squares equal 21 — 20i. 


Let z = a + bi, so we have (a + bi)* = 21 — 20i. Expanding the left side gives 
a —b + 2abi = 21 — 201. 


Since a and b are real, we can equate the real parts of both sides to get a” — b? = 21, and equate the imaginary parts 
to find 2ab = —20. Solving 2ab = —20 for b gives b = —10/a. Substituting this into a? — b? = 21 gives 


Multiplying this by a? gives us a* — 21a? — 100 = 0. Factoring gives (a? — 25)(a? + 4) = 0. However, a is real, so a? 
must be nonnegative. Therefore, we have a* = 25, so a = +5. Since b = -10/a we have b = -2 when a = 5, and 
b = 2 when a = —5. So, the two possible values of z are +(5 — 21). O 


Having found the two complex numbers whose squares equal 21 — 207, we can see some of the difficulty that 


arises from defining V21 — 20i. Which of the values 5 — 2i or —5 + 2i should we take to be the square root? We can’t 
simply call one “positive” and one “negative.” 


Due to these difficulties, we try to avoid using expressions like ¥21 — 20i, and instead think of the “numbers 
whose squares equal 21 — 207.” We may sometimes refer to these two numbers as “the square roots” of 21 — 20i, 
and may even refer to one or the other as “a square root” of 21 — 20i. But we generally won't choose one or the 


other to equal V21 — 20i the way we choose 5 to equal V25. 


We have these same difficulties when trying to define V-81. For example, which of 9i or —9i is “the” square 
root of —81? However, as we saw in Problem 6.1, we may occasionally stumble on expressions like V-81 when 
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solving equations like z? = —81. When this happens, the square root should always be preceded by +, as in 


z = Ł V-81. In other words, we are referring to both “square roots” of —81; that is, we are considering both 
numbers whose squares are —81. 


Problem 6.21: Find and graph all z such that |z — 3| = |z + 2il. 


Solution for Problem 6.21: Solution 1: Algebra. Letting z = a + bi, for real numbers 
a and b, the equation becomes |a + bi — 3| = |a + bi + 2i|, or 


(a — 3) + bil = ja + (b + 2)il. 


Applying the definition of the magnitude of a complex number to both sides of 
this equation, we have ž 


y (a - 3P +b? = ya? + (b + 2)2. 


Squaring both sides of this equation and expanding the squares of binomials gives 


a — 6a +9 +b =R? +b? +4b+4. 


Simplifying this equation gives 6a + 4b = 5, so our equation is 6 - Re(z) + 4 : Im(z) = 5. The graph of this equation 
is a line, shown at right above. We can write the equation 6 - Re(z) + 4: Im(z) = 5 in terms of z and Z by noting that 
Re(z) = (z+Z)/2 and Im(z) = (z—Z)/(21). Substituting these into 6: Re(z)+4-Im(z) = 5 gives us (3-—21)z+(3+2z = 5. 


We also could have tackled this problem algebraically by squaring |z — 3| = |z + 2i| to get |z — 3? = |z + 2i|?. We 
then use the fact that |w|? = ww for all complex numbers w, which gives us (z — 3)(z — 3) = (z + 2i)(z + 2i). Since 
zZ—3=Z-3 =Z-—3and (z+ 2i) =Z+ 2i = Z —2i, we have (z — 3)(Z — 3) = (z + 2i)(Z — 2i). Expanding both sides and 
rearranging gives (3 — 2i)z + (3 + 21)z = 5, as before. 


Solution 2: Geometric intuition. We rewrite the equation so that both magnitudes are of differences between 
complex numbers, |z — 3| = |z — (—21)|. The quantity on the left side of the equation is the distance from z to 3 in the 
complex plane. Similarly, the right side is the distance from z to —27 in the complex plane. Therefore, our equation 
tells us that z is equidistant from 3 and —2i in the complex plane. So, z must be on the perpendicular bisector of 
the segment that connects 3 and —27 in the complex plane. 


The slope of the line connecting 3 and —2i is 3, so the slope of its perpendicular bisector is —3. The perpendicular 


bisector passes through the midpoint of 3 and —2i, which is 45% = 3 — i. Using this point and the slope of the line, 


we can write a point-slope form of the line: 


Im(z) — (—1) = 3 (Ree) — >) : 


Rearranging this equation gives us 6 : Re(z) + 4: Im(z) = 5, as before. O 


The key to our second solution is using the geometric interpretations of |z — 3| and |z + 24l. 


_ Many complex number concepts have a strong geometric significance as well as 
an algebraic one, so if you have trouble understanding an algebraic relationship 
regarding complex numbers, try considering it in the complex plane. 


Concept: : 


Problem 6.22: The complex number z satisfies z+ |z] =2+ 8i. What is |z|? (Source: AHSME) 
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Solution for Problem 6.22: Solution 1: Keep your eye on the ball. We want |z|, and our equation involves |z|, so we let 
iz| = r. Then, we have z = 2 + 8i-|z| = (2—r) + 8i. Written this way, we can see that the complex number z has real 
part 2 —r and imaginary part 8. Therefore, we have 


Iz? = (2—r)? + 8? =r — 47 +68. 


We also have |z|* = r”, and combining this with our equation above gives r? = r* — 4r + 68. Solving this equation 
gives usr = 17, so |z| = 17. 


Solution 2: Let z = a + bi. Then, the given equation is 
a + bi + Va? + b? =2 + 8i. 


The only imaginary term on the left is bi, so we have b = 8. Equating the real parts of both sides then gives 
a + Va? + 64 = 2. Isolating the radical gives Va? + 64 = 2 — a and squaring both sides gives a? + 64 = 4 — 4a +a’. 


Solving this equation gives us a = —15. Because we squared the equation Va? + 64 = 2 — a, we must check if the 
solution is extraneous. It isn’t, so |z| = |-15 + 8] = 17. o 


a > 


6.3.1 Find |(10 + 247)(8 — 67)| without finding the product (10 + 247)(8 — 67). 
6.3.2 Show that |z| = |Z]. 
6.3.3 Solve 3z + 4Z = 12 — 5i for z. 
6.3.4 Solve each of the following equations: 
(ee 2 =) (b) 2=-5+12i (c) 2*7=24-10i 
6.3.5 Letz=3+41and w = 5 — 121. Evaluate the following expressions: 


1 z lz| 
() oma © || CE 


Compare your answers in parts (a) and (b), and compare your answers in parts (c) and (d). Notice anything inter- 
esting? (You should!) Prove that your observations will hold for any nonzero complex numbers. 


Z 


6.3.6 Find all complex numbers z such that z/Z is 


(a) areal number (b) animaginary number 


6.3.7 Two solutions of x* — 3x° + 5x? — 27x — 36 = 0 are pure imaginary numbers. Find these two solutions. 
(Source: ARML) Hints: 174 


6.4 Nonreal Roots of Polynomials 


We can sometimes find roots of a polynomial by factoring the polynomial. For example, to find the roots of the 
quadratic x? — 5x + 6, we factor the quadratic as (x — 2)(x — 3). Then, we have (x — 2)(x — 3) = 0 if and only ifx-2 =0 
or x — 3 = 0, which gives us the roots x = 2 and x = 3. So, we see that if we can factor a polynomial as the product 
of linear factors, then we can quickly find the roots of the polynomial, since each linear factor clearly has exactly 
one root. 


We run into a complication with the polynomial x* + 1. We can’t factor this as the product of two linear terms 
with real coefficients, and the polynomial doesn’t have any real roots. So, we introduced the idea of complex 
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numbers to describe the roots of x? + 1, writing them as x = i and x = ~i, where í is the number whose square is 
—1. Complex numbers therefore also allow us to factor x? + 1 as a product of linear factors: 


xX +1=(x-i(x+i). 


This might make you wonder if we need any more special types of numbers beyond complex numbers to 
factor more complicated polynomials as a product of linear factors. The answer turns out to be “no”; complex 
numbers are sufficient to factor any polynomial with complex coefficients. In order to see why, we’ll need the 
Fundamental Theorem of Algebra. 


The Fundamental Theorem of Algebra states that every nonconstant one- 
variable polynomial with complex coefficients has at least one complex root. 


Important: 


Unfortunately, to prove the Fundamental Theorem of Algebra, we need tools that are much more advanced 
than those we will study in this text. 


We'll also use the Factor Theorem and the Rational Root Theorem to study polynomials. 


The Factor Theorem states that for any complex number a, the binomial x — a 
is a factor of the polynomial f(x) if and only if f(a) = 0. 


Important: 


Important: The Rational Root Theorem states that if i is a rational root of 


1 


FQ) = One a T E 


where p, q, and all of the a; are integers and gcd(p,q) = 1, then ap is a multiple 


The Factor Theorem and the Rational Root Theorem are covered in detail in Art of Problem Solving’s Intermediate 
Algebra. 


Ls > 


Problem 6.23: Show that if complex numbers w and z satisfy wz = 0, then either w = 0 or z = 0. 


Problem 6.24: Consider the polynomial f(x) = a,x" + an-1X"7! +--+. + ao, where Ap, An-1, -- - „40 are complex 
numbers and a, # 0. 

(a) Show that there exist complex numbers 11,12,...,1n such that f(x) = ay(x — r1)(x = r2) +++ (x = fa). 

(b) Use part (a) to explain why f has exactly n complex roots, where the same number may appear multiple 
times among the roots. (For example, the polynomial x? has the three roots 0, 0, and 0.) 


Problem 6.25: Find the roots of the following polynomials: 
(a) f(x) =X -— 2x? + 3x -18 


(b) g(x) = 2x4 — 8x? — 3x* + 22x — 40 
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Problem 6.26: Let p(x) = anx” + dy—1x""} + +++ + a1xX + ao and q(x) = Amx” be polynomials with real coefficients. 
(a) Show that q(z) = q(Z) for all complex numbers z. 


(b) Show that pe) = p(z) for all complex numbers z. 
(c) Show that if p(z) = 0, then p(z) = 0. 


Problem 6.27: Find the monic polynomial f(x) of minimal degree such that f(2 — i) = f(1 + 4i) = 0 and all 
the coefficients of f(x) are real. (The degree of a polynomial a,x" + a,x"! + +--+ a9, with a, # 0, is n, and 
the polynomial is called monic if a, = 1. For example, x° + 3x* + 7 is a monic polynomial with degree 6, and 
3x° — x + 1 is a degree 5 polynomial that is not monic.) 


Problem 6.28: Consider the equation c4z‘ + ic3z* + cz? + ic1z + co = 0, where co, C1, Co, C3, and c4 are real constants. 
q 3 


(a) Find a function h such that substituting z = h(x) into the polynomial above produces a fourth degree 
polynomial in x with real coefficients. 


(b) If one solution for z to the original equation is a + bi, where a and b are real, then what value of x solves 
your equation from part (a)? 


(c) What other value of x must be a solution to your equation from (a)? 


(d) Prove that z = —a + bi is a solution to the original equation. 
(Source: AHSME) 


Problem 6.23: Show that if complex numbers w and z satisfy wz = 0, then either w = 0 or z = 0. 


Solution for Problem 6.23: If w is 0, then the “w = 0 or z = 0” condition is satisfied. If w is not zero, then we can 


divide both sides by w to get z = 2 = 0, and again the “w = 0 or z = 0” condition is satisfied. 0 


We can extend the result of Problem 6.23 to any number of complex numbers: if 21Z223-::Z, = 0, then at 
least one of the numbers 21,2Z2,...,Z, must be 0. This property is called the zero-product property. It’s the same 
property that we use for real numbers to solve a quadratic equation like x” — 5x + 6 = 0. We factor the equation 
as (x — 2)(x — 3) = 0, and then note that if the product (x — 2)(x — 3) is 0, then either x - 2 = 0 or x - 3 = 0. So, 
the solutions are x = 2 and x = 3. Of course, as we have seen, there are polynomials that cannot be factored as 
a product of linear terms with real coefficients. Let’s see what happens if we allow complex coefficients in the 
factors. 


Problem 6.24: Consider the polynomial f(x) = a,x” + An ax" | +--+ a9, where ay,4n-1,---,9 are complex 
numbers and a, # 0. 
(a) Show that there exist complex numbers r1, r2, ..., fn such that f(x) = a(x — 11)(x — r2) <+- (x — rn). 


(b) Use part (a) to explain why f has exactly n complex roots, where the same number may appear multiple 
times among the roots. (For example, the polynomial xè has the three roots 0, 0, and 0.) 


Solution for Problem 6.24: 


(a) The Fundamental Theorem of Algebra tells us that this polynomial! has a complex root, which we'll call r4. 
So, if we divide x — rı into f(x), the quotient is a polynomial g(x) with degree n — 1. By the Factor Theorem, 
we therefore have 

F(x) = (x — 11) 9(X) = (x — 11) (Dna) + bn-2xX"7? + +++ + bo) 


for some constants b,_1, b,_2,...,b9. Applying the Fundamental Theorem of Algebra to g(x) we know that 
it must have a complex root that we'll call r2. Dividing g(x) by x — r2 gives a degree n — 2 quotient. Again 
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by the Factor Theorem, we have 
F(x) = (x —11)(x — re)h(x) 


for some degree n — 2 polynomial h(x). Continuing in this manner, the quotient at each step has degree 1 
less than the previous quotient. Therefore, we can express f(x) as the product of a constant and n linear 
factors: 


F(X) = Gale = 1) 7a). 


If f(x) = 0, then product (x — 11)(x — r2): +- (x — rn) must equal 0. By the zero-product property, the product 
of several complex numbers equals 0 if and only if at least one of the numbers is 0. Therefore, we can only 
have f(x) = 0 if x —7; = 0 for some i. The only solution to x — r; = 0 is x = 7;, which is a complex number. 
This tells us that f has exactly n complex numbers as its roots. 


| Every polynomial of degree n has exactly n complex roots. (A number may 
appear multiple times among the roots of the polynomial. For example, the 
polynomial (x — 1)(x - 2° has the three complex roots 1, 2, and 2.) 


Important: 


Problem 6.25: Find the roots of the following polynomials: 
(a) f(x) =x - 2x? +3x-18 


(b) g(x) = 2x4 — 8X — 3x? + 22x — 40 


Solution for Problem 6.25: 


(a) 


(b) 


We use the Rational Root Theorem to hunt for integer roots by testing divisors of 18 to see if they are roots 
of f(x). We find that f(3) = 0 and 
f(x) = (x - 3)(x? +x + 6). 


We can’t factor x? + x + 6 easily, so we turn to the quadratic formula to find that the roots of x? + x + 6 are 


_ -14iVv23 
=— 


—1 + į V23 
= 


So, the three roots of f(x) are 3 and 
We start by searching for rational roots. We find that x + 2 divides g(x) evenly and gives us 
g(x) = (x + 2)(2x° — 12x? + 21x — 20). 


We continue by searching for roots of 2x? — 12x? + 21x — 20. Again using the Rational Root Theorem to 
restrict our search, we find that x — 4 is a factor. Dividing 2x? — 12x* + 21x — 20 by x — 4 gives us 


g(x) = (x + 2)(x — 4)(2x? — 4x + 5). 
As in part (a), we tackle the quadratic with the quadratic formula and find that its roots are 


= 4 + 2i V6 = %6; 
4 2 


So, the four roots of g(x) are —2, 4, 1 — 1 and 1 + m 


211 


CHAPTER 6. BASICS OF COMPLEX NUMBERS 


For both polynomials in Problem 6.25, the nonreal roots come in conjugate pairs. In other words, if a + biis a 
root, then so is a—bi. From the quadratic formula, we can see that nonreal roots of a quadratic with real coefficients 
must form a conjugate pair. Let’s see if we can prove a similar statement for higher degree polynomials. 


Problem 6.26: Let p(x) = anx" + a„-1x"7! +--+ +a 1x + a9 and q(x) = amx” be polynomials with real coefficients. 


| (a) Show that q(z) = q(z) for all complex numbers z. 


(b) Show that p(z) = p(z) for all complex numbers z. 
(c) Show that if p(z) = 0, then p(z) = 0. 


Solution for Problem 6.26: 


(a) In Problem 6.15, we showed that for all complex numbers w and z, we have Wz = W-Z. Using this 
relationship, along with the fact that am is real, we see that 


AmZ™ = Am Z" = Amz". 


Now, we note that 


which gives us q(Z) = AmZ™ = Amz" = Amz" = a(2). 
(b) In Problem 6.15, we also showed that for all complex numbers w and z, we have w +z = w +Z. Combining 
this with part (a), we have 
p(z) = AZ” + Ay Zz"! +--+ +A1Z + ag 
= AyZ" + An-12"7! + -+ + AZ + M9 


Tb. +Z + ag 


= AZ" era, 42 
= p(2), 
as desired. 
(c) From part (b), we have p(Z) = p2). So, if p(z) = 0, then we have p(z) = pe) =0= 0, 


O 


Important: Let p(x) bea polynomial with real coefficients. If z is a root of p(x), then Z is also 


a root of p(x). In other words, the nonreal roots of p(x) come in conjugate pairs. 


We can sometimes use this fact to build a polynomial given some, but not all, of its roots. 


Problem 6.27: Find the monic polynomial f(x) of minimal degree such that f(2 — i) = f(1 + 4i) = 0 and all 
| the coefficients of f(x) are real. (The degree of a polynomial a,x" + a,-1x""! + --- + ao, with a, # 0, is n, and 
the polynomial is called monic if a, = 1. For example, x° + 3x? + 7 is a monic polynomial with degree 6, and 
3x° — x + 1 is a degree 5 polynomial that is not monic.) 


Solution for Problem 6.27: Since 2 — iis a root of f and the coefficients of f are real, 2 + 1 must also be a root of f. 
Therefore, the quadratic with roots 2 — i and 2 + i must be a factor of f. Here are two ways to find this quadratic: 


Method 1: Multiply the linear factors. Since 2 — i and 2 + i are roots, the desired quadratic is 
(x —(2-1))(x-(2+1)) = x - (2 + ix — (2 — i)x + (2 - (2 + i 
=x% -4x +5. 


212 


6.4. NONREAL ROOTS OF POLYNOMIALS 


Method 2: Sum and product of roots. The sum of the roots of ax? + bx + c = 0 is —b/a and the product of the roots 
is c/a. Our work in Method 1 above shows why this result is true when a = 1. Specifically, we can write a monic 
quadratic with roots r and s as (x — r)(x — s). Expanding this product gives x? — (r + s)x + rs, so the coefficient of the 
linear term is the opposite of the sum of the roots, and the constant term is the product of the roots. Since our roots 
in this case are 2 — i and 2 + i, the sum is 4 and the product is 5, which means the desired quadratic is A —4x +5. 


Similarly, because 1 + 4i is a root of f, so is 1 — 4i. The sum of these roots is 2 and their product is 17, so 1 + 4i 
and 1 — 4i are the roots of the quadratic x? — 2x + 17. 


Since f must be divisible by both x? — 4x + 5 and x? — 2x + 17, and these two quadratics have no linear factors 
in common, the product of these two monic quadratics produces the desired monic polynomial f with smallest 
possible degree: 

f(x) = (7 — 4x + 5)(x* — 2x + 17) = x4 — 6x7 + 3007 — 78x + 85. 


Oo 


Problem 6.28: Suppose a + bi is a solution of the polynomial equation 


caz* + ic3z? + CZ? + icz + Co = 0, 


| where Co, C1, C2, C3, C4, A, and b are real constants. Prove that —a + bi is also a solution. (Source: AHSME) 


Solution for Problem 6.28: Let 9(z) = c4z* + ic3z* + coz” + icyz + co. The coefficients of g(z) are not all real, so we can’t 
deduce that the nonreal roots come in conjugate pairs. We don’t know much about roots of polynomials with 
nonreal coefficients, so we'd like to relate g(z) to a polynomial in which all the coefficients are real. If we had such 
a polynomial, we'd at least know that its nonreal roots come in conjugate pairs. 


We notice that the imaginary coefficients are in the terms with odd powers of z, so if we let z = xi, the resulting 
coefficients of x are real: 
g(xi) = Cie eae =e — CX een: 


Aha! Now, we've related g(z) to a polynomial with real coefficients. We let 
l f(x) = g(xi) = Ge a Ox — CX + cy 


and investigate the roots of f(x). We are given that g(a + bi) = 0. Because f(x) = g(xi), we therefore know that f(x) 
is 0 when xi = a + bi. Solving for x gives us x = (a + bi)/i = b — ai, and we have 


f(b — ai) = g((b — ai)i) = g(a + bi) = 0. 


Since f(b — ai) = 0 and f(x) has real coefficients, we know that b — ai = b + ai is also a root of f(x). Now, because 
g(xi) = f(x) = 0 when x = b + ai, we see that g(z) = 0 when z = xi = (b + ai)i = -a + bi, as desired. D 


6.4.1 Find all roots of g(x) = 2x3 + 5x? + 15x + 18. 
6.4.2 Find all roots of f(t) = 2t — 23f + 27t — 36. 


6.4.3 Let g(x) be a polynomial with real coefficients such that g(2 + i) = g(3 + 7) = 0, and deg g < 4. Other than 
2+iand 3 +i, find all possible roots of g. 


6.4.4 Prove that a fourth degree polynomial with a real root and real coefficients must have another real root 
(that may be the same as the first real root). 
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6.4.5x A polynomial of degree four with leading coefficient 1 and integer coefficients has two real zeros, both of 


which are integers. Which of the following can also be a zero of the polynomial? 
1+ivll 1+i 1 i 1+iv13 
= l r eee 
2 2 2 2 
(Source: AMC 12) Hints: 2 


6.4.6% Find all values of x such that xt + 5x? + 4x +5 = 0. Hints: 262 


6.5 Summary 


A complex number is a number of the form a + bi, where a and b are real numbers and 7 = —1. We call a the real 
part of a + bi and we call b the imaginary part. If b + 0, then a + bi is nonreal. If a = 0, then the complex number 
a + bi is called an imaginary number. Addition and multiplication of complex numbers are defined as follows: 


(a + bi) + (c + di) = (a + c) + (b + di, 
(a + bi)(c + di) = (ac — bd) + (ad + boi. 


Important: For any complex numbers v, w, and z, we have: 


ew+z=z+wand wz = Zw 
e v+ (w +2) = (v + w) + zand (vw)z = v(wz) 


è v(w +z) = VW + Uz 


| Important: The powers of i repeat in cycles of 4: | 
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If z =a + bi, then a — bi is called the conjugate of z, denoted Z. So, we have a + bi = a — bi. Together, a nonreal 
number and its conjugate are referred to as a conjugate pair. The product of a complex number and its conjugate 


is a real number. For any complex numbers z and w, we have zZ + W = Z + W and ZW =Z-W. 


Important: 


the conjugate of the denominator, Z: 


_ Wz 


Z2 


l 
ni| NI 


If w and z are complex numbers and z + 0, then we express the quotient w/z 
as a complex number by multiplying both the numerator and denominator by 


Since 2Z is always real, we can express wZ/(zZ) as a complex number, which 
means we can write it in the form a + bi for some pair of real numbers a and b. | 


A complex number can be represented as a point in the complex plane. Like the Cartesian plane, the complex 
plane has two axes: a horizontal real axis for the real part, and a vertical imaginary axis for the imaginary part. 
The real axis of the complex plane is labeled Re and the imaginary axis is labeled Im. Plotting the complex number 


x + yi on the complex plane is the same as plotting the point (x, y) in the Cartesian plane. 
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6.5. SUMMARY 


The magnitude of z, denoted by |z|, equals the distance from z to the origin on 
the complex plane. If z = a + bi, we have 


|z| = Va? + b? = Vz-Z. 


Important: We often use e Re(z) to refer to the real part of the complex — Z and we 
use Im(z) to refer to the imaginary part of z. So, for example Re(2 — 4i) = 2 and 
| Im(2 — 4i) = —4. For any complex number z, we have 


Re(z) = a and = = = 
ATOT aT namnaman = 2i = j 


The Fundamental Theorem of Algebra states that every one-variable polynomial with complex coefficients 
has at least one complex root. 


Important: Every polynomial of degree n has exactly n complex roots. (A number may 
DA appear multiple times among the roots of the polynomial. For example, the 
polynomial (x — 1)(x — 2)? has the three complex roots 1, 2, and 2.) 


| Important: Let p(x) be a polynomial with real coefficients. If z is a root of p(x), then Z is also 
| VY) a root of p(x). In other words, the nonreal roots of p(x) come in conjugate pairs. 


Things To Watch Out For! 


OE a 
| WARNING!! We have to be careful about using notation like V-81. The real-valued 
| C function f(x) = yx is only defined for x > 0. When we write V-81 as an 
= —81, we should keep in 


| intermediate step to go an equation like z7 


| _ mind that hat we mean 1 iV81 | CSN 


Problem Solving Strategies 


A A a a E a a a a NER NT a 
Concepts: e If you are given a problem involving a general complex number z, it may 
O= help to write z in the form a + bi, and then express the problem in terms of a 


and b. 


e It’s sometimes easier to work with the square of the magnitude of a complex 
number than with the magnitude itself, in part because we can then apply 
the relationship |z|}? = zz. 


e Trying specific examples can be a good way to discover general relation- 
ships. 


e Many complex number concepts have a strong geometric significance as 
well as an algebraic one, so if you have trouble understanding an algebraic 
relationship regarding complex numbers, try considering it in the complex 
plane. 
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i 


6.29 Letw=2+3i and z = 4 — 5i. Express each of the following as a complex number: 


REVIEW PROBLEMS 


(a) 2w-3z (c) = a (e) =! ae 
w + 2w?z + wz? (1 -i)* 
(b) w wz + wz (f) (1 + i)9 


6.30 Solve the equation -3 — x* = 9 + x’. 


6.31 Express in the form a + bi, where a and b are real numbers. 


Lae : 
a 
1+i 
6.32 Find the complex number z such that 1 — =-1+:;. 
6.33 Four complex numbers are plotted in the plane, as shown. One of them is z. The other Im 
three are —z, Z, and —Z. Which is which? z 


6.34 Let w and z be complex numbers. R 
e 


(a) Show that wz + wz is real. 
(b) Show that wz — wz is imaginary. 
6.35 Find two complex numbers whose squares equal 5 — 121. 


6.36 Find all real numbers c such that 7 +7 is 5 units from 10 + ci on the complex plane. 


6.37 Find the area of the region enclosed by the graph of |z — 4 + 5i] = 2V3. 


6.38 The diagram to the right shows several numbers in the complex plane. The circle Im 
has radius 1 and is centered at the origin. One of these numbers is the reciprocal of F. 
Which one? (Source: AHSME) 


6.39 Find all complex numbers z such that |z + 1 — i| = |z —2|. 


6.40 The product of the complex numbers a + bi and c + di is real, where a, b, c, and 
d are real numbers. Prove that the product of the complex numbers b+ ai and d +ciis p° 
also real. 


6.41 Describe the graph of each of the following: 
(a) 2-2 =—8) (b) (4-1)z- (4402 = 161 (c) |7+i-2z|=4 
6.42 Graph |z- 5 + 2i| < 4 on the complex plane. 
6.43 Simplify (i + 1)? — (i — 1). 
6.44 Show that |z — 1|? + |z + 1]* = 4 for all z such that |z| = 1. 
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CHALLENGE PROBLEMS 


6.45 
(a) Prove that for any complex numbers w and z, we have 2(|wi* + |z|?) = [w — zi? + |w + zi. 
(b) The relationship in part (a) is sometimes referred to as the parallelogram law. Why? 
6.46 One of the roots of x* + 9x° + 48x? + 78x — 136 = 0 is —3 + 5i. Find the other three roots. 


6.47 Let f(x) be a polynomial with degree 6 and rational coefficients. The polynomial has five distinct roots: 1, 
2-1,2+i11-i,and1 +i. 


(a) Wecalla value that appears twice among the roots of a polynomial a double root of the polynomial. Explain 
why f has a double root. 


(b) Which root is the double root? 


6.48 Let 7r, = a + bi and rz = a — bi, where a and b are nonzero real numbers. Find the minimum degree of a 
polynomial with real coefficients and roots ri and rzi. 


Challenge Problems > 


6.49 Evaluate i + 2i? + 37° + 4i4 +---+ 641%. 
6.50 Leta, b, and c be complex numbers. Suppose we want to solve the equation az + bz = c for z. 


(a) Show that if |a| + |b|, then the equation has a unique solution z. 
(b) Describe the solutions when a = —1 + 3i, b = 1 + 3i, and c = i. 
6.51 Find all complex numbers z such that |z — 1| = |z + 3| = |z — i]. (Source: ARML) Hints: 121 


6.52 Find the area of the region of points z in the complex plane such that 


iz+4-4i)<4V2 and |z-4-4iļ > 4V2. 


6.53 Find all roots of the polynomial iy® — 8y? — 22iy + 21. 


6.54 Show that the points w, x, y, and z are the vertices of a parallelogram in the complex plane if and only if the 
sum of some two of them is equal to the sum of the other two. Hints: 164 


4uv . 
6.55x Letu and v be complex numbers such that |u| = |v| = 1, u + v, and u # —v. Show that Ue is real, and 
+v 
that 


- is imaginary. Hints: 102 


6.56x Let (x, y) bea pair of real numbers satisfying 


56x + 33y = — and 33x— Boy = 5 


y 
x? + y? +y 
Determine the value of |x| + |y|. Hints: 94 


6.57x Suppose we define a sequence of complex numbers by zı = 0 and zn = z2 +i forn = 1. How far away 
from the origin is 2111? (Source: AHSME) Hints: 206 


6.58% Show that if wı + wz + ws = 0 and |wy| = |we| = [ws = 1, then w? + w3 + w3 = 0. Hints: 11, 69, 60 


a ee 
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6.59x A function f is defined on the complex numbers by f(z) = (a + bi)z, where a and b are positive numbers. 
This function has the property that the image of each point in the complex plane is equidistant from that point 
and the origin. Given that |a + bi] = 8, find the value of b?. (Source: AIME) Hints: 143, 125 


6.60 For all nonzero complex numbers z, let f(z) = 1/2. 


(a) Show that f(f(z)) = z for all z + 0. 


(b)x Asz varies along the line (1 + 2i)z — (1 — 2i)z = i, what curve does f(z) trace? 
Hints: 268 


an S 8a — b 


6.61x Find all ordered pairs (a, b) such that a + aR 2o 


= 0. Hints: 226 
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A complex system that works is invariably found to have evolved from a simple system that worked. — John Gall 


CHAPTER ss | 


Trigonometry and Complex Numbers 


7.1 Polar Form of Complex Numbers 


In Chapter 5, we learned how to convert rectangular coordinates (x, y) into polar coordinates (r, 0). In this section, 
we apply this conversion technique to complex numbers to write complex numbers in polar form. 


= > 


Problem 7.1: Explain why every complex number can be written in the form 7(cos 0 + i sin @) for some constants 
rand @ with r > 0. We call this form the polar form of a complex number. 


Problem 7.2: Suppose z = r(cos 0 + isin 0), where r > 0. What is |z| in terms of r and 0? 


Problem 7.3: Write each of the following in polar form by finding the positive value of r and the value of @ with 
0 < 0 < 27 such that the given complex number equals r(cos 8 + isin 0). 


(a) -6 
(b) 444i 
(c) 3-iv3 


(d) -v2+iv6 


Problem 7.4: Let w = 3 + i V3 and z = 2 V3 - 6i. 
(a) Find wz. 
(b) Express w, z, and wz in polar form. Notice anything interesting? 
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Problem 7.5: Let w = r(cos a@ + isin q) and z = s(cosf + isin f). Express wz in polar form. 


Problem 7.6: 
(a) Prove that (cos 0 + isin@)" = cos nO +isinné for any positive integer n and any angle 0. 


(b) Prove that (cos @ — isin 0)" = cosn@ ~ isin n0 for any positive integer n and any angle @. 


Problem 7.7: Express cos 50 in terms of cos 0. 


Problem 7.1: Explain why every complex number can be written in the form r(cos@ + isin 0) for some 


constants r and 0 with r > 0. We call this form the polar form of a complex number. 


Solution for Problem 7.1: Let z be the complex number x + yi, where x and y are real numbers. In Section 5.2, we 
learned that any point in the Cartesian plane with rectangular coordinates (x, y) can also be denoted with the polar 
coordinates (r, 0), where x = r cos 0 and y = r sin 0, with r > 0 and 0 < 0 < 27. Therefore, for any ordered pair of 
real numbers (x, y), there is a constant r and an angle 0 such that x = rcos 0 and y = rsin 0. This means that we 
can write the complex number x + yi in the form r(cos 0 + isin 0) for some constants r and @ such that r > 0 and 
0<0<27. 0 


The expression cos 0 + isin @ is sometimes shortened to cis 0. We’ll use this notation sparingly in this text, 
because in the next section we will introduce a better way to express complex numbers in terms of r and 0. 


Because the complex number z = x+ yi in the complex plane corresponds to rectangular coordinates (x, y) in the 
Cartesian plane, we call x+ yi the rectangular form of the complex number z. Similarly, because z = r(cos 0 +7 sin 0) 
in the complex plane corresponds to the point in the Cartesian plane with polar coordinates (r, @), we call this the 
polar form of z. We call r the magnitude of the complex number and @ the number’s argument. 


Don’t we already have something called the magnitude of a complex number? 


Problem 7.2: Supposez = r(cos 0 + isin 0), where r > 0. What is |z| in terms of r and 6? 


Solution for Problem 7.2: Solution 1: Algebra. We have 


Iz] = |rcos @ + ir sin 6| = 1/(rcos 0}? + (rsin 8)? = y r?(cos? 0 + sin? 8) = Vr =r. 


(The step Vr? = r is only valid because r > 0.) 


Solution 2: Geometry. As we saw in Problem 7.1, the point r(cos 0 + i sin 0) in the complex plane corresponds to 
the point (r, 0) in the Cartesian plane. Since |z| equals the distance from z to the origin in the complex plane, and 
the distance from (r, 0) to the origin in the Cartesian plane is r (when r > 0), we have |z| = r. O 


with 0 < 0 < 27 such that the given complex number equals r(cos 0 + isin 0). 
(a) -6 
(b) 4+4i 
(c) 3-iv3 
-V2 +ivV6 
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Solution for Problem 7.3: 


mportant: Converting complex numbers in rectangular form to polar form is exactly the 
same as converting the rectangular coordinates (x, y) to polar coordinates (r, @). | 


I 


(a) The point —6 in the complex plane is 6 units to the left of the origin. In the Cartesian plane, the polar 
coordinates of the point 6 units to the left of the origin are (6,7). Therefore, a polar form of the complex 
number —6 is 6(cos n + isin 7), or 6 cis 7. 


(b) The point 4+ 47 in the complex plane is point P in the diagram at right. SincePA=AO=4, Y 
we know that APAO is an isosceles right triangle. Therefore, we have PO = 4 V2 and 
ZPOA = T which means that a polar form of 4 + 4i is 4 V2 2 (cos 4 7 ising z), or 4 V2 cis Z a 


O A 


(c) The point 3 — i V3 in the complex plane is point P in the diagram at right. We have QO A 


= VPA? + AO? = 2 V3, so the hypotenuse of APAO is double the smaller leg of APAO. Š 


Therefore, we have 4POA = %. Since P is % clockwise from the positive x-axis, it is P 


6 
Uz radians counterclockwise from the x-axis. So, we can write 3 — i V3 in polar form as 


25 (cos #4 + isin Hx), or 2-V3cis HZ. 4 


(d) os point — V2 +i V6 in the complex plane is point P in the diagram at right. We have y 
= VPA? + AO? = 2 v2, a the hypotenuse of oe is double the smaller leg of APAO. ee 
THe we have ZPOA = 4, which means that P is 24 radians p OS a the 


positive x-axis. So, the polar a of — V2 +i V6 is 2 Nz 2 (cos 28 T iam F z), or 2 V2 cis 22 


Sidenote: You may have noticed that while we usually write complex numbers in the form 
a + bi, with the i after the imaginary part of the number, in the problem above we 
have written 3 — í V3 and — V2 + i V6 instead of 3 — V3i and — V2 + V6i. We follow 
this convention in part to avoid confusion, since we might mistake the i in 3 — V3i 
as being inside the square root symbol. (This mistake is particularly easy to make 


when the numbers i in question ar are hand-written. ) 


| Problem 7.4: Letw=3+ i V3 and z = 2 V3 - 6i. 
(a) Find wz. 


(b) Express w, z, and wz in polar form. Notice anything interesting? 


Solution for Problem 7.4: 
(a) Wehave 


wz = (3 + i V3)(2 V3 — 6i) = 6 V3 — 18i + 6i + 6 V3 = 12 V3 - 12i. 


(b) We could take a geometric approach, as we did in Problem 7.3. Or, we can note by the forms of w, z, and 
wz that the arguments of these complex numbers are multiples of 7. We can determine the corresponding 
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arguments by first factoring out the magnitudes of each, and then using our knowledge of trigonometry to 
find the arguments. Since |w| = |3 + i ¥3| = 2 V3, we factor 2 V3 out from w to find 


w=3+iv3= 298 (+53) 


The argument of w then is the angle @ such that cosO0 = = and sin = 4, so we have 0 = # and 


w =2 V3 (cos Ẹ +isin Z). 


Similarly, |z| = |2 y3 — 6il = V12 + 36 = = 43, and we have z = 2V3- 6i = 43 (3 — A This means 
the argument of z is the angle 0 such that cos @ = 4 and sin = = , which gives us 0 = °°. So, we have 


j= 4V3 (cos $ +isin $). 
Finally, we have |wz| = |12 V3 — 12i| = V432 + 144 = 24, and wz = 24 ($ -35 li). Therefore, we seek 0 


such that cos 0 = ¥3 and sin @ = —4, which gives us 0 = 2" and wz = 24 (cos 47 uz 4 isin u), 


Listing all of our results, we have 


T 
w=2V3 (cos © + isin ©), 


z =4V35 (cos +isin =), 


wz = 24 (cos == +isin Z5), 


Interesting—the magnitude of wz is the product of the magnitudes of w and z, and the argument of wz is 
the sum of the arguments of w and z. 


Is our observation at the end of this solution true for any two complex numbers w and z? 


Problem 7.5: Let w = r(cosa + isin a) and z = s(cosf + isin). Express wz in polar form. 


Solution for Problem 7.5: We have 


wz = r(cosa@ + isin a) - s(cosf + isin f) 
= rs-(cosa@ + isin a)(cos ß + isin $) 
= rs -(cosacosf — sin æ sin f + isinacosf + i cosa sin f). 
We recognize the real part of the expression in parentheses as cos(@ + ß) and the imaginary part as sin(a + ß), so 


we have 
wz = rs (cos(a@ + p) + isin(a + f)) . 


For any two complex numbers w and z, the magnitude of wz is the product of 
the magnitudes of w and z, and the argument of wz is the sum of the arguments 
of w and z (or a multiple of 27 less than the sum of the arguments of w and z, if 
we restrict arguments to being in the interval [0, 27)). 
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Problem 7.6: 
(a) Prove that (cos 0 + isin @)" = cos n@ + isin n8 for any positive integer n and any angle 0. 


(b) Prove that (cos 0 — isin @)" = cosn@ —isinn@ for any positive integer n and any angle 0. 


Solution for Problem 7.6: 


(a) We have already done the heavy lifting for this problem when we solved Problem 7.5. There, we showed 
that (cis a)(cis $) = cis(a + 6). So, we have 


(cos 0 + isin 0}? = (cos 6 + isin @)(cos 0 + isin 0) = cos(0 + 0) + isin(0 + 0) = cos 20 + isin 20. 
We multiply this result by cos 0 + isin 6 to find an expression for (cos 0 + isin 0}: 
(cos @ + isin 0} = (cos @ + isin @)(cos 0 + isin 0) 
= (cos 0 + isin 8)(cos 20 + isin 20) 


= cos(9 + 20) + isin(O + 20) 
= cos 30 + isin 30. 


We use induction to extend this argument to any integer n. Clearly, we have (cos 0+i sin 0)” = cosn0+isin nO 
for n = 1. Assuming that (cos 0 + isin 6)‘ = cosk@ + isink@ for some positive integer k, we have 


(cos 0 + isin @)**! = (cos @ + i sin )(cos 0 + isin 6) 
= (cos 0 + isin 0)(cos k0 + isin k0) 
= cos(0 + k@) + isin(@ + k6) 
= cos((k + 1)0) + i sin((k + 1)0). 


Therefore, by induction, we have (cos 6 + isin 8)" = cos nO + isin n0 for all positive integers n. 


(b) The complex numbers in this part are not in polar form, but they are the conjugates of the complex numbers 
in part (a). This observation gives us a quick solution. If two complex numbers are equal, their conjugates 
must be equal. So, taking the conjugate of both sides of 


(cos 0 + isin 0)” = cosn@ +isinn6@ 


gives 
(cos 6 + isin 0)" = cosn@ + isinn@ = cosn@ — isin nð. 


Since z” = (Z)", we have 
(cos Ô + isin 6)" = (cos 0 + isin 0)” = (cos 0 — isin 6)", 


so we have the desired (cos 0 — isin 0)" = cosn@ —isinné. 


The relationship we proved in part (a) is commonly referred to as de Moivre’s Theorem. 


‘Important: De Moivre’s Theorem states that _ TE - r; nf ? 


(cos 0 + isin 0)” = cosnO + isin nð 


for any @ and any positive integer n. — 


NN ee 
- 223 


CHAPTER 7. TRIGONOMETRY AND COMPLEX NUMBERS 


This relationship is historically important because it helped pave the way for a deeper understanding of 
complex numbers that we explore in the next section. Because de Moivre’s Theorem is a straightforward result of 
the concepts we'll develop in the next section, you don’t need to memorize it now. 


| Problem 7.7: Express cos 50 in terms of cos 0. j 


Solution for Problem 7.7: We could start by writing cos 50 = cos(40 + @) and using the angle sum identity, but that 
would give us an expression that includes cos 40 and sin 40. We could then write 40 as 30 + 0 and apply angle 
sum identities again, but that will create quite a mess. Let’s see if we can find a better approach. 


As a starting point, we seek an equation that involves both cos 50 and cos 0. De Moivre’s Theorem gives us 
one: 
cos 56 + isin50@ = (cos @ + isin 0Y. 


Now, we just have to expand the right side. We can use the Binomial Theorem or a bit of algebra to see that 
(x+y)? = 2° + Sxty + 1029 y* + 10x7Y? + 5xy* + y. 
We expand (cos 6 + isin 0} by letting x = cos 0 and y = isin @ in our expansion of (x + y)°. This gives us 
cos 50 + isin5@ = cos’ 0 + 5icos* @ sin 0 — 10 cos? @ sin? O — 10i cos? O sin? 0 + 5cos O sin‘ 0 + isin? 0. 


The real part of the left must equal the real part of the right. Combining this with the fact that sin? 0 = 1 — cos? 6 
gives us 
cos 50 = cos’ 6 — 10 cos’ @ sin? 9 + 5cos 0 sint @ 
= cos’ @ — 10 cos? O(1 — cos? 0) + 5 cos (1 — cos? 6)" 
= cos’ 6 — 10 cos? 6 + 10 cos? @ + 5cos O(1 — 2 cos? 0 + cos* 0) 
= cos’ 6 — 10 cos? 6 + 10 cos? O + 5cos 0 — 10 cos? O + 5 cos? 0 
= 16 cos 0 — 20 cos? 0 + 5 cos 0. 


Concept: De Moivre’s Theorem can help us tackle problems involving cosn0 or sin n0, 


Q= where n is a positive integer greater than 1. 


| Sidenote: Above, we showed that 


cos 50 = 16 cos? 0 — 20 cos? 6 + 5 cos 0. 


So, if we let f(x) = 16x — 20x° + 5x, then cos50 = f(cos@). This polynomial 
f(x) is one of a family of polynomials called the Chebyshev polynomials of the 
first kind. These polynomials are typically denoted T,,, and satisfy the definition 
To(x) el) = x, and 

Troi = 2xT p(x) — Tn-1 (x). 
You'll havea chance to explore Chebyshev polynomials in Challenge Problem 7.69, 


where you'll show that T,,(cos 0) = cos n@ for any positive integer n and any angle 
0. (In some sources, “Chebyshev” is given the alternative spelling “Tchebychev,” 


which may explain why we typically use “T” for the Chebyshev polynomials.) 
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[Exercises 


7.1.1 Write each of the following in polar form: 


(a) ees 
(be 2; 
(c) 2V3+6i 
Gi 
7.1.2 Write each of the following in rectangular form: 
(a) 6cis150° O aae 
(b)  8cis3780° (d) 9cis sn 


7.1.3 Show thatif w = r cis a and z = s cis $ (and z + 0), then ¥ = £ cis(a — p). 
7.1.4 Evaluate each of the following in your head without using paper, pencil, pen, or a calculator. 
(a) (6 V3 + 18i)(—2 V3 + 6i) (b) (4+ 41)(-3 - 3i)(-10 + 103) 


7.1.5 For what values of 0 does cis 0 = cis(—0)? 
7.1.6 ‘Test the identity we found in Problem 7.7 for cos 50 by letting 0 = 30°. 
7.1.7 Express cos 70 in terms of cos @. 


7.1.8x For how many positive integers n less than or equal to 1000 is 
(sint + icos t)” = sinnt +icosnt 
true for all real t? (Source: AIME) Hints: 52 


7.2 Exponential Form of Complex Numbers 


In the last section, we saw that the argument of a product of complex numbers is the sum of the arguments of the 
complex numbers. This observation inspires another notation for complex numbers. 


tl 
Problem 7.8: Let f(x) = cosx +isinx. 


(a) Show that f(0) = 1 and f(x)f(y) = f(x + y) for all real x and y. 


(b) Suppose a function g satisfies g(0) = 1 and g(x)g(y) = g(x + y) for all real x and y. Show that it also satisfies 
the following: 


Problems 


(i) g(x) #0 for all x. 
(ii) g(x— y) = g(x)/g(y) for all real numbers x and y. 


(iii) [g(x)]" = g(nx) for any real number x and any positive integer n. 
(iv) g(-x) = 1/g(x). 
(c) What other functions g have g(0) = 1 and satisfy the identity ¢(x)g(y) = g(x + y) for all x and y? 
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Problem 7.8: Let f(x) = cosx +isinx. i one. 
(a) Show that f(0) = 1 and f(x) f(y) = f(x + y) for all real x and y. 


(b) Suppose a function g satisfies ¢(0) = 1 and g(x)g(y) = g(x + y) for all real x and y. Show that it also 
satisfies the following: 


(i) g(x) + 0 for all x. 
(ii) g(x- y) = g(x)/g(y) for all real numbers x and y. 


(iii) [e(x)]" = g(nx) for any real number x and any positive integer n. 
(iv) 9(—x) = 1/g(x). 
(c) What other functions g have g(0) = 1 and satisfy the identity 9(x)g(y) = 


g(x + y) for all x and y? 


Solution for Problem 7.8: 


(a) Wehave f(0) = cos0+isin0 = 1 + 0i = 1. Proving that f(x) f(y) = f(x + y) is essentially a simplified version 
of Problem 7.5: 


F(X fly) = (cos x + isin x)(cos y + isin y) 
= cos xcos y — sin xsin y + i(sinx cos y + cos x sin y) 
= cos(x + y) + isin(x + y) 
= f(x+y). 


(b) Letting y = —x in g(x)g(y) = g(x + y) gives g(x)e(—x) = g(0) = 1 for all x. Therefore, we cannot have g(x) = 0 
for any x, since 9(x)g(—x) cannot be 0. 
Since g(x)e(y) = g(x + y), dividing both sides by g(y) gives (x) = g(x + y)/g(y). Letting x + y = z, so that 
x =Z — y, gives us 9(z — y) = 2(z)/g(y), which proves (ii). 
The identity [¢(x)]” = g(nx) is simply a repeated application of g(x)g(y) = g(x + y). Specifically, we have 
Ke = g(x)e(x) = g(2x), and [9(x)P = [g()P9(x) = 2(2x)e(x) = g(2x + x) = 9(3x), and so on. We can 
formalize this with induction. We clearly have [g(x)]' = g(x), and if [2(x)]* = g(kx) for some positive integer 
k, then we have 
BO = gwl) = g(x)g(kx) = g(x + kx) = g((k + 1)x). 
This shows that if [g(x)]” = 9(nx) holds for n = k, then it holds for n = k + 1. We have already showed that it 
holds for n = 1, so by induction, it holds for all positive integers n. (Perhaps you recognize this argument 
from our proof of de Moivre’s Theorem.) 
Since 9(0) = 1 and 9(x)/g(y) = g(x — y), letting x = 0 gives us 1/g(y) = 9(—y), which proves identity (iv). 
(c) Exponential functions have this behavior! For example, if g(x) = 2”, we have g(0) = 2° = 1 and 


g(x)g(y) = 2 -2 = 209 = g(x + y). 


Notice that exponential functions also satisfy the identities described in part (b); these are examples of the 
laws of exponents with which you're already familiar. 


We see that the arguments of complex numbers appear to behave a lot like exponents. Specifically, we showed 
in Problem 7.8 that the function 
f(x) = cosx +isinx 


satisfies the equation 


F(x) Fly) = fx + y) 
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for all x and y. Moreover, we confirmed that f(0) = 1, and that f satisfies many of the relationships we know as 
the laws of exponents, such as f(x)/f(y) = f(x - y), f(—x) = 1/ f(x), and [f(x)]" = f(x) for all positive integers n. 


Because f(x) = cos x+isin x satisfies the same relationships that exponential functions satisfy, we have a special 
exponential notation for complex numbers. We write cos x + isin x as e”. 


Definition: For all real x, we define e” as 


e* = cosx +isinx. 


With this notation, we can write the identities we showed in Problem 7.8 as 
ex 3 eY = Be) 


e* Jel = PE 


z n s 
(e=) = e" 


elx 


Now you see why we use this notation—all of these relationships are easy to recall because they are the same 
as the laws of exponents. For example, if you see e /e? in a problem, it’s easy to recall that this equals e-), but 
if you see (cos æ + isina)/(cos f + isin f) in a problem, it may not be immediately obvious that this expression 
equals cos(a— f) +isin(a—). You might also have noticed that for positive integers n, the relationship (e”) ap 
is de Moivre’s Theorem. This is why we suggested not memorizing de Moivre’s Theorem after we proved it on 
page 223. Exponential form makes de Moivre’s Theorem easy to remember. 


Because cos x + isin x = e", we can write the polar form of a complex number z = r(cos 0 + isin 0) in the form 


We call this the exponential form of the complex number z, where r is the magnitude of z and @ is the argument, 
as with polar form. 


WARNING!! We always use radians in exponential notation, never degrees. 


4,6 


You'll have to wait until your study of calculus to see technical reasons for using radians rather than degrees, 
but our introduction of radians in Section 2.2 gives a qualitative explanation why we prefer radians. 


Sidenote: The symbol e is not just a placeholder to remind us that complex numbers can 
ay be written in an exponential form. There is a real number e such that we have 
£? = cos@+isin9. This real number is irrational and approximately equal to 
2.7182818. 
There are many ways to define e. One of the simplest is to consider the 


expression g(n) = (1 + 1)" Using a calculator or computer, compute g(n) for 
larger and larger values of n. As n gets very large, you'll find that g(n) appears to 
become constant. That constant is e. 

While you'll find e very useful in your study of complex numbers, you'll see 
many more wonderful properties of e in your continued study of mathematics, 
and you'll learn why it is considered one of the most important constants in 
mathematics. 
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(b) 1-i (c) -3V¥3+3i 


Problem 7.10: Express each of the following in rectangular form: 


(b) 6e™/3 (c) Ae /4 


Problem 7.11: Evaluate the following: 
(a) (+i) b) (~iv3) 


Problem 7.12: For what values of 8 is e°? = 1? t 


Problem 7.13: Show that if z = re’, then Z = re”, 


Problem 7.14: 
(a) Use exponential form to prove that zZ = |z|? for all complex numbers z. 


(b) Use exponential form to prove that |zz| = |z||w| for all complex numbers z and w. 


(c) Prove that complex number multiplication is associative. In other words, show that if w1, w2, and ws are 
complex numbers, then (w1w2)w3 = w (ww). 


Problem 7.15: Express e™ - e in terms of cos x, sin x, cos y, and sin y. How does your result make remembering 
the angle sum identities for cos(x + y) and sin(x + y) easy? 


Problem 7.16: 


(a) Find cos@ in terms m e? and e”. 


(b) Explain how you can use your relationship from part (a) to remember the product-to-sum identity for 
cos X COS y. 


Problem 7.17: Find the positive integer n such that 


arctan l + arctan 1 + arctan : + arctan l = 
3 4 5 n 4 


(Source: AIME) Hints: 74, 124 


£ Problem 7.9: Express each of the following in exponential form: = 
| 


| @) =D (b) 1-i (c) -3¥3+3i 


Solution for Problem 7.9: Converting rectangular form to exponential form is essentially the same as converting 
rectangular form to polar form. 


(a) We have |—2i| = 2. Since —21 ie - units below the origin in the complex plane, the argument of —2i in 
exponential (and polar) form is 3%. Therefore, we have —2i = 2¢°""/?. 
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(b) We have |1 — i] = V2, so we convert 1 — i to exponential form by ene 1-i= V2 (+ ve 2; 2i) and finding 
the angle 0 such that cos 0 = 2 and sin 0 = = . This angle is 2, so we have 1 — i = Por ue 


(c) We have |-3 V3 + 3il = 6, so we convert —3 V3 + 3i to exponent form n E -3 V3 +31 =6 (-¥ + i) 


PET 
and finding the angle 0 such that cos 0 = — 2 and sin @ = = 5- This anglei is =, so we have —3 V3 + 3i = 6207/6, 


Problem 7.10: Express each of the following in rectangular form: 


(a) em (b) 6e7™#/3 (c) 4e77#/4 


Solution for Problem 7.10: 


(a) We have e™ = cosn + isinn = —1 + 0i = —1. 


Sidenote: Adding 1 to both sides of e™ = —1 gives us an equation that relates the five most 
important constants in mathematics: 


| 


o e +1=0. 
(b) We have 6e"? = 6 (cos 3 + isin?) = 6 (} + Bi) =3 + 3i V3. 
(c) We have 4e”"'/4 = 4 (cos + isin) =4 (2 - Wj) = 2 V2 — 2i V2. 


Oo 


[Problem 7.11: Evaluate the following: J 
pa) +)” Gi Gai v2): 


Solution for Problem 7.11: 


(a) We could start with the Binomial Theorem to expand (1+i)”°... on second thought, maybe not. Exponential 


form offers a much easier way. In exponential form, 1 + i is ¥2e™/*, so we have 


(1 + i)? = (V2e7!/4)20 = (21/2)20,20nt/4 — 9105? — 1024(cos 57 + isin 57) = —1024. 
(b) Asin the previous part, we convert 3 — i V3 to exponential form, writing it as 2 ¥3e!!"/6. We therefore have 


(3 — i ¥3)° = (2 V3el!™/6)5 = (2 V3) e76 = 32. 9 V3 (cos =z +isin n) : 


Subtracting multiples of 27 repeatedly from 22" gives us 


(3 -i v3) = 288 VB (cos + isin E) = 28608 ( -32 - i) = —432 — 144i V3. 


Problem 7.12: For what values of 0 is ef = 1? 


Solution for Problem 7.12: We obviously have £? = 1 when 0 = 0, but are there other values of 0 for which e? = 1? 
Writing e°? in polar form gives us cos O + isin 0 = 1, which means that cos 0 = 1 and sin 0 = 0. Therefore, 0 must 
be an integer multiple of 27 in order to have Cae 
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We also could have reasoned to this answer by thinking about the complex plane. Plotting the point e’ ê in the 

complex plane is the same as plotting the point with polar coordinates (1, @) in the Cartesian plane. So, the point 

£? is on the unit circle graphed in the complex plane. Specifically, it is the terminal point of 8, which means we 
have e®° = 1 if and only if 0 is an integer multiple of 27. o 


‘Problem 7.13: Show that if z ae then Z = re`’, 


Solution for Problem 7.13: We have z = re!’ = r(cos 0 + isin 0), so 
Z=rcos0+risin@ =rcos@ —risin@ = r(cos @ — isin 0). 
Because cos 0 = cos(—@) and sin 0 = — sin(—@), we have 


Z = r(cos 0 — isin 0) = r(cos(—@) — i(— sin(—@))) = r(cos(—@) + isin(—@)) = fen 
-i0 


Important: If z = re®, then Z = re 


Problem 7.14: 


(a) Use exponential form to prove that zz = |z|? for all angle: numbers B 


(b) Use exponential form to prove that |zw| = |z|lwl for all complex numbers z and w. 


(c) Prove that complex number multiplication is associative. In other words, show that if w1, wz, and w3 
are complex numbers, then (w1w2)w3 = wı(wzw3). 


Solution for Problem 7.14: 


(a) Letz = re?, so |z| =r and Z = re™®. Therefore, we have 
By = e ee = = |p 


(b) Back on page 205, we bashed rough this proof with rectangular form, but exponential form gives the 
result immediately. Let z = re’? and w = se’. Then, |z| = r, |w| = s, and 


|zw| = |(re!)(se®)| = |rse*)| = |rs|le*| = rs = lew]: 


c) Let w, = ne, w = me, and w = r3e. Then, we have 
1 1 s W2 2 , 


i(01+02) | Kente 


(w1w)w3 = (rie! ne) rze = nne 17308 = nre 


w (wzw) = re (rae k rsei”) = rem (rargel(2*®)) = ieee) 


SO (W1W2)W3 = W1(W2W3). 


‘Concept: a Exponential form sometimes offers an easy way to prove facts involving complex | 


number multiplication. ' 
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Problem 7.15: Express e’*-e'Y in terms of cos x, sin x, cos y, and sin y. How does your result make remembering 
the angle sum identities for cos(x + y) and sin(x + y) easy? 


Solution for Problem 7.15: We have 
e™ -e = (cosx + isin x)(cos y + isin y) 
= cos x cos y — sin xsin y + i(sinx cos y + cos xsin y). 
However, we also have 
e™ eY = el”) = cos(x + y) + isin(x + y). 
Our two expressions for e” - eY must be equal: 
cos(x + y) + i sin(x + y) = cos xcos y — sin x sin y + i(sinx cos y + cos x sin y). 

Two complex numbers are equal if and only if their real parts are equal and their imaginary parts are equal. 
Therefore, the equation above gives us 

cos(x + y) = cos x cos y — sin xsin y, 

sin(x + y) = sin x cos y + cos xsin y. 
These are the familiar angle sum identities, and now we have a way to quickly remember them, rather than having 
to memorize them. O 


We don’t call our work above a “proof” of the angle sum identities because our explanation above depends 
upon the fact that the function f(x) = cosx + isinx behaves like an exponential function. However, we proved 
this behavior using the angle sum identities! So, using this behavior to “prove” the angle sum identities would 
be circular reasoning. (By “circular reasoning,” we mean using a statement we wish to prove as a step in proving 
the statement itself. This is clearly invalid!) 


| Problem 7.16: 
(a) Find cos @ in terms of e° and e 


-i8 


| (b) Explain how you can use your relationship from part (a) to remember the product-to-sum identity for 
| COS X COS y. 


Solution for Problem 7.16: 


(a) We have 
eÊ = cos@ + isin, 
e™® = cos(—0) + i sin(—0) = cos @ — i sin 0, 
where we have used the identities cos(—0) = cos 0 and sin(—@) = — sin 0 in our expression for a Adding 
our equations for et? and e~® eliminates isin 0 and gives e? + e`? = 2 cos 0, so 
ei 4 eid 
cos 0 = a ina 


As an Exercise, you'll derive a similar expression for sin 0. 


Important: For any angle 0, we have 


Y 
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(b) Using the identity from the previous part, we have 


eX +e N (ev + ely 
cos x COS y = 5 5 


= z CET ET 4 ei) 


E 1 eifxty) T ew ixty) Fi eilx-y) at, e7t(a-Y) 
2 2 i 


Again applying the identity from part (a), we see that the two fractions in parentheses above equal cos(x + y) 
and cos(x — y), and we have the product-to-sum identity for a product of cosines: 


cos x cos y = ; (cos(x + y) + cos(x — y)) . 


Concept: The exponential form of complex numbers is a helpful way to remember trigono- 
| metric identities. i 


Problem 7.17: Find the positive integer n such that | 


arctan l + arctan l + arctan I + arctan l ENA 
3 4 5 n 4 


Solution for Problem 7.17: We could tackle this problem by taking the tangent of both sides and using the angle 
sum identity for tangent repeatedly. But complex numbers offer a much nicer solution. On the left, we have a sum 
of angles. Where else have we seen a sum of angles? When we multiply complex numbers in polar or exponential 
form, the argument of the product is the sum of the arguments of the factors. So, we can view each term on the 
left as the argument of a complex number, and the sum on the left is the argument of the product of these complex 
numbers. 


But what complex number has argument arctan t? Comparing the rectangular form to the polar form quickly 


gives the answer, since x + yi = r(cos@ + isin 0) gives us x = rcos@ and y = rsin@. Dividing these gives us 
Z = Sinf = tan @. So, a complex number with tan 0 = t is x+ yi = 3 + li. Similarly, 4 + 1i has argument arctan ie 


5 + 1i has argument arctan 3, and n + 1i has argument arctan +. The argument of the product 


(34+1(44+05+i(n + i) 


is equal to the sum of the arguments of these four complex numbers. From the expressions we just found for the 
arguments of these four numbers, we see that the argument of this product is 


1 1 1 1 
arctan = + arctan — + arctan = + arctan —, 
3 4 5 n 
which we are given is equal to Ẹ. Since the argument of (3 + i)(4 + 1)(5 + i)(n + i) equals 7, and cos Ẹ = sin 4, the 
real and imaginary parts of the product must be equal positive numbers. Expanding the product gives 
(3 +i (4+ i)(5+ i(n +7) = (11+ 71)(5 + )(n +i) 
= (48 + 46i)(n + 7) 
= (48n — 46) + (48 + 46n)i. 
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We therefore have 48n — 46 = 48 + 46n, from which we find n = 47. 0 


7.2.1 Express each of the following in exponential form: 


(a) -32i @ 2422 

(b) -8+8i V3 (d) 215+ 2i-v5 
7.2.2 Express each of the following in rectangular form: 

(a) etri (c) eomi/3 

(b) eom/4 (d) e)7ni/3 


cif _ g-i? 
7.2.3 Prove that sin 0 = S 
i 
ex 
7.2.4 Reproduce the angle difference identities for sine and cosine by expressing T in rectangular form. 


7.2.5 Compute the following in your head. (Don’t use paper, pencil, pen, or a calculator.) 


8 + 8i -12 — 4i V3 


(a) -3 +31 ©) 3+ 3iv3 


7.2.6 Evaluate each of the following: 
oa (b) (2V3 e 
7.2.7 For what positive integers n is (—3 + i V3)" a real number? 


7.2.8 Show that e + ef = 2 cos ( zE) Ca 


7.3 Roots of Unity 


In this section, we focus on the roots of a special class of polynomials, those of the form 
fe) =x =i, 


where n is a positive integer. The roots of such a polynomial are called the n™ roots of unity. We call these the 
roots of unity because finding the roots of x” — 1 is equivalent to finding those numbers whose n' power is 1. 
Obviously, 1 is one of these roots, but if n > 1, then there are other roots of f, as well. We explore these other roots 
in this section because they appear in so many areas of mathematics. 


Problem 7.18: Find the roots of each of the following polynomials in rectangular and exponential forms. ee 
the roots of each on the complex plane. 


a) f@=x-1 (b)  g(x)=x*-1 (c) h(x)=xf-1 
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Problem 7.19: In this problem, we find the roots of f(x) = x° — 1. 


(a) How many roots does f have? Why is it difficult to find the roots of f using the techniques you used in 
the previous problem? 


(b) Lett = re’. If f(t) = 0, then what is r? 
(c) If f(£) = 0, then what are the possible values of £? 


Problem 7.20: Let n be a positive integer. What are the roots of f(x) = x" ~ 1? If we plot the roots in the complex 
plane, then they are the vertices of what type of figure? 


Problem 7.21: Suppose w is a root of f(z) = z" — 1. Which of the following must also be a root of f(z)? 


(a) @ (b) -1 


Problem 7.22: In this problem, we extend our understanding of the roots of unity to find the roots of other 
complex numbers. 


(a) At most how many different complex numbers have fourth powers equal to —8 — 8i V3? 


(b) Find all complex numbers whose fourth power is —8 — 8i V3. Plot all of them in the complex plane. 


How are your answers to part (b) related to the fourth roots of unity? 


Let x = re. How are the n™ roots of x related to the n™ roots of unity? 


Problem 7.23: An n root of unity, z, is called a primitive n™ root of unity if z" = 1 and z* # 1 for k = 


1,2,3,...,n — 1. For example, —1 is a primitive square root of unity, since (~1)* = 1, but (—1} # 1. 
(a) Find the primitive cube roots of unity. 


(b) Find the primitive fourth roots of unity. 


(c) Find the primitive sixth roots of unity. 


Let n be a positive integer greater than 1. Prove that e?”/” is a primitive n® root of unity. 


Problem 7.24: Is e&/5 a primitive fifth root of unity? 


Problem 7.25: Let w = e*""/", where k is a positive integer, be an n"" root of unity. 
(a) Show that if gcd(k,n) = 1, then w is a primitive n™ root of unity. 
(b) Show that if gcd(k,n) + 1, then w is not a primitive n™ root of unity. 
(c) Leta and b be positive eee such that 0 < b <a < n. Show that if w is a primitive n™ root of MDL, 


then it is impossible to have œ” = w’. Why does this show that the first n — 1 powers of w are the n — 1 n™ 
roots of unity besides 1? 


| Problem 7.18: Find the roots of each of the following polynomials in rectangular and exponential forms. Plot 
| the roots of each on the complex plane. 


Ha fee -1 
| b) g@)=x4-1 
| (c) h(x) = x° —1 
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Solution for Problem 7.18: 


(a) Factoring x? — 1 as a difference of cubes, we have f(x) = (x - 1)? +x+4+1). (We Im 
also could have noticed that f(1) = 0, so x — 1 is a factor of f.) Therefore, 1 
is a root of f, and so are the roots of the quadratic x? + x +1. Applying the 


quadratic formula gives us the roots x = zivs, So, the roots in rectangular form 


1 Bo 5 À 7 : 4 à 
ane l =z 5) -}- ip and in exponential form, they are e°, e?™/3 ,e*!/3, Since the Re 


magnitude of each root is 1, we can plot each by plotting the terminal point of the 
angle of the exponential form of the root. Connecting the three points produces 
an equilateral triangle. 


(b) Factoring x* — 1 as a difference of squares gives (x? — 1)(x? + 1). The roots of x? - 1 Im 
are 1 and —1, and the roots of x? + 1 are i and —i, so the roots of f(x) in rectangular 
form are 1,7,—1,—i. In exponential form, these roots are e°, e!/*,e", 87/2, Again, 
the magnitude of each root is 1, so we can plot each root by locating the terminal 
point of its angle in exponential form. As shown at right, plotting the roots on the 


complex plane produces the vertices of a square. SS 


(c) Factoring xí — 1 as a difference of squares gives (x? — 1)(x° + 1). Factoring each of these as a difference or 
sum of cubes gives us 


HOj=o == Na = 1l)=@—-lo +e oe =a 


So, 1 and —1 are roots of f, and applying the quadratic formula to the two quadratics tells us that the 
following are the roots of f in rectangular form: 


eee We VE a 1+ivV3 1-ivV3 


ike , , 1 , . 
2 2 2 2 
In exponential form, these are , l ma , 
e, gaa PTR an aie erils | 
Putting these in order by increasing argument (angle), we have Im 


e, ee EaR e, ey ers. 


Each argument is 7/3 more than the previous, and we aren’t surprised to find that 
these roots are evenly spaced about the unit circle. Connecting them in this order Re 
gives us a regular hexagon, as shown at right. : 


Oo 


[Problem 7.19: In this problem, we find the roots of f Gy=7 i 


| (a) How many roots does f have? Why is it difficult to find the roots of f using the techniques we used in 
| 


the previous problem? 
(b) Lett = re. If f(t) = 0, then what is r? 
(c) If f(t) = 0, then what are the possible values of t? 


Solution for Problem 7.19: 


(a) Because f has degree 5, it has 5 roots. We can’t use the difference of squares or difference of cubes 
factorizations, but we can see that f(1) = 0, so 1 is a root. Factoring out x — 1 gives 


f(x) = (x-1 +P $22 4041). 
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Unfortunately, now we're stuck. None of our typical root-finding techniques helps us with x4 +x? +x? +x+1. 
We'll have to try something different. 


(b) Having seen that all the roots of x? —1, x* —1, and x°—1 are of the form e”, we try to find roots in exponential 
form. We let t = ræ, where r is a nonnegative real number and t is a root of f. Since f(t) = 0, we have 


(re? r = 1, or Pe*? = 1, The magnitude of the right side is 1, so the magnitude of the left side must be 1. 
Since r is nonnegative, we have |728| = 1°, so now we have r° = 1, which means r = 1. (To see why there 
are no other nonnegative real numbers such that 7° = 1, note that 7° > 1ifr > 1andr <1if0<r<1)) 


(c) Since f(t) = 0 and t = ret? gave us Pe? = 1, and we found that r = 1, we must Im 
have e”? = 1. As we saw in Problem 7.12, if e”? = 1, then 50 must be an integer 
multiple of 2x. Therefore, we must have 50 = 27k for some integer k in order 
to have e° = 1. Dividing by 5 gives us 0 = 2mk/5, so the roots are of the form 


t = e’"*/5_ Letting k be each of 0, 1, 2,3, and 4 gives us the roots of f: Re 


el, pee ES one g8il5, 


Any other integer k just reproduces one of these values. For example, if k = 5, 
then 0 = 2nk/5 = 2r, which produces the same root as 6 = 0. Similarly, increasing k by 5 always increases 
0 by 27, which doesn’t change e"? because ef(9+2") = e8 . e274 = e8 .1 = e”, 

We aren't at all surprised to see that the roots of f are equally spaced around the unit circle, and that 
connecting them forms a regular pentagon. 


Problem 7.20: Letn bea positive integer. “What are the roots of f(x) = x"—1? If we plot the roots in the 
complex plane, then they are the vertices of what type of figure? 


Solution for Problem 7.20: The previous two problems give us a clear guide. We let t be a root of f(x), and write t 
in exponential form: t = re’, where r = 0. Since f(t) = 0, we must have (re’”)” = 1, so r"e"® = 1. This tells us that 
|r"e"9| = 1. Since r > 0, we have |r*ein?| = r", so |r"e”9| = 1 means that 7” = 1, which gives us r = 1. (Yes, this 
should all look familiar—it’s exactly the same reasoning we used to find the roots of x° — 1.) 


Since r = 1, the equation f(t) = 1 is now e”° = 1, which means we must have n@ = 27k for some integer k. 
Dividing by n gives us 0 = 2rk/n, so our roots are of the form t = Eons Letting k be each of 0,1,2,...,n—1 gives 


us the roots of f: 
0 ,2ni/n Anijn ,6ni/n 2(n-1)ni/n 
ee ie ee 


Notice that any other integer k just reproduces one of these values. For example, if k = n, then 0 = 2nk/n = 2r, 
which produces the same root as 0 = 0. Similarly, increasing k by n always increases 0 by 27, which doesn’t 
change e°. 


These n roots are equally spaced about the unit circle. The angle of each is 27/n from its neighbors, including 
the roots e? and e2"-)/"_ Therefore, plotting these n roots on the complex plane produces the vertices of a regular 
polygon with n sides. O 


Important: ~~ The n roots of 


Fa, 
Ha 
EH, 


e, ae ge poe TR e2(n—-Vni/n 


These are also referred to as the n™ roots of unity. 
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(a) @ (b) w-1 


Solution for Problem 7.21: 


(a) As we showed on page 212, if z is a root of a polynomial f with real coefficients, then Z is also a root f. 
Therefore if w is a root of f(z) = z" — 1, then so is @. In other words, if w is an n™ root of unity, then so is @. 


(b) We can quickly see that w —1 is not necessarily a root of f by noting that w = 1 is a root of f(z) = z” — 1, but 
this gives w — 1 = 0, which is clearly not a root of f. 


OPORTERET 


(c) Since w” = 1, we have 


Therefore, + is a root of f. 


Now that we know how to find n® roots of unity, we can find the n'™* roots of other complex numbers. 
| Problem 7.22: 
(a) At most how many different complex numbers have fourth powers equal to —8 — 8i V3? 
(b) Find all complex numbers whose fourth power is —8 — 8i V3. Plot all of them in the complex plane. 
| (c) How are your answers to part (b) related to the fourth roots of unity? 


(d) Let x = re. How are the n®™ roots of x related to the n roots of unity? 


Solution for Problem 7.22: 


(a) We seek the complex numbers z such that z* = -8 - 8i v3. Rewriting this as z+ + 8 + 8i V3 = 0, we see that 
we seek the roots of the polynomial z4 + 8 + 8i V3. Since this polynomial has degree 4, it has 4 roots. So, 
there are no more than 4 different complex numbers whose fourth power is —8 — 8i V3. (We don’t know for 
sure yet that there are 4, because we haven’t shown that the polynomial doesn’t have any repeated roots.) 


(b) We found the roots of unity by using the exponential form of complex numbers. Let's try the same here. 
We let z = re!?, where r > 0, be a solution to the equation z* = —8 — 8i V3, so we have 


re? = -8 — 8i V3. 


It’s not so clear what to do with that, so we write the right side in exponential form, too. This gives us 


; 1 V3 r 
A AiG _ ee oS Ani /3 
re = 6 ( 5 ) 16e : 


Comparing the magnitudes of both sides of te“? = 16e4"° gives us r* = 16, which gives usr = 2 (remember, 
r is a nonnegative real number). All that’s left is to find @ such that e“° = e*/°. Clearly, 0 = } works, and 


gives us the solution z = re™/> = 2 ( A+ i) = 1 +i V3; however, we expect to find four solutions, not just 
one. What are the others? 


We find the others by recalling that e“¢*?” = e'* for all angles a. So, 


pAni/3 — pil(4n/3)+2n] — gil(4n/3)}+4n] — pil(4r/3)+6r] — 
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This means that we can write the equation e“? = e479 as e#9 = ell(4"/3)+27K] where k is an integer. From this 
equation, we have 40 = = +2nk. Dividing by 4 gives 0 = 3 + z, When k = 0, we have the first solution we 
found, 0 = §. Letting k = 1,2,3 gives 0 = 57, “*, 422, respectively. When k = 4, we have 0 = 3 + 27, which 
gives us the same value of re? that k = 0 gave us. Similarly, any other value of k reproduces one of the four 
solutions we have already found, since increasing k by 4 increases 0 by 27, which leaves e? unchanged. 


Putting our four values of 0 together with r = 2 gives us the following Im 
four numbers whose fourth power is —8 — 8i V3: 


Dem/8 27/6 2e Jelini/6 
r Z i = 


In rectangular form, these are 
1+iV3, -V3 +i, -1-iV3, V3-i. = 


We have plotted these in the complex plane at right. Connecting them 
forms a square. 


(c) Seeing that the four fourth roots of —8 — 8i V3 are the vertices of a square in the complex plane makes us 
wonder if these roots are related to the roots of unity. In our solution to part (b), we found the desired 
values of O with the equation 0 = 7 + ak The angles ak for k = 0,1,2,3 are the angles of the fourth roots of 
unity. This allows us to express each of our solutions for re? in terms of one of the fourth roots of unity: 

2e(7i/3)+0 = Dem/3 7 d 


QelM4/3)+(7H/2) — peril? , gril? 


Perin _ eril , gri 


’ 


9 elt /3)+ Gri/2) = 2e™/3 E e2 


Aha! To produce the solutions of z* = 16e*“/>, we multiply each of the fourth roots of unity by 2e™’. 


(d) Part (c) gives us a guide for generating the n“ roots of any complex number re’’. First, we find the n“ roots 

of unity, which are e?"*/" for 0 < k < n. Next, we note that if z” = re’, then raising both sides to the power + 
gives z = “/re'®/" as one solution. We then multiply each of the n® roots of unity by this number to produce 
the n roots of re”. 


To confirm that this procedure does indeed produce the n values of z such that z” = re’, letz = re?l". w, 
where w is one of the 2 roots of unity. Since w is an n' root of unity, we have w” = 1, and 


z" = (re?! w) = CoE e "o" =r, 


so z is indeed one of the n? roots of re. Since no two of the n™ roots of unity have the same argument, 
multiplying each of these by ¥/re'/" produces n complex numbers that have n different arguments. In other 
words, no two of these n numbers are the same. Therefore, we have found the n different numbers whose 


n™® power is re!” 


h 


Problem 7.23: An n® root of unity, z, is called a primitive n™ root of unity if z" = 1 and z* # 1 for 
k =1,2,3,...,n—1. For example, -1 is a primitive square root of unity, since (-1)* = 1, but (-1)! + 1. 


(a) Find the primitive cube roots of unity. 
' (b) Find the primitive fourth roots of unity. 


| (c) Find the primitive sixth roots of unity. 


( 


d) Letn be a positive integer greater than 1. Prove that e?”/" is a primitive n' root of unity. 
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Solution for Problem 7.23: 


(a) By the definition of primitive roots of unity, if w is a primitive cube root of unity, then w° = 1, w + 1, and 
w? + 1. The three cube roots of unity are 1, e2"!/5, and e47/3, Clearly the first is not a primitive cube root of 
unity since we must have w # 1. We check the other two by computing their squares. We have 


(eae = g'il 41, 


( a T 


Therefore, both e?”/5 and e*™’/3 are primitive cube roots of unity. 


(b) The fourth roots of unity are 1, e”/, e”, e/?, The first and third are not primitive fourth roots because the 
first equals 1 and the square of e™ is 1. The other two are primitive fourth roots because for each one, the 
root, its square, and its cube all do not equal 1. So, the primitive fourth roots of unity are e™/? and &"/2, 


(c) The sixth roots of unity are 
1,3 eih oni Ani/3 Si/3. 


The first is not a primitive sixth root. Both e?”/3 and e#/> are cube roots of unity, so they are not primitive 
sixth roots. Similarly, e” is a square root of unity, so it’s not a primitive sixth root. If w = e™/, then af + 1 
for 1 < k < 5 (since w* for these values represent the 5 sixth roots of unity besides 1). To check the powers 
of 23, we let w = e°™/> and compute the first 5 powers of w. When the argument of the result is greater 
than 27, we can subtract integer multiples of 2x without changing the result. We find: 


w! = ee 
aw = e107i/3 2 ea 
ww = eidni/3 = eee 


wt = eil = grin 
4p) = Cane = E 

Not only are none of these five powers of w equal to 1, but all five are different. Interesting! We now know 
that e&”/3 is a primitive cube root of unity, and we have a clue that we'll investigate in Problem 7.25 for 
determining which n roots of unity are primitive. 


(d) In each of the previous parts, we saw that e”/" is an n™ root of unity. We can quickly prove that this is 


always the case by noting that if w = e/", then af = &"*/", Jf 0 < k <n, then the argument of w* is 
between, but not equal to, 0 and 27. Therefore, w* cannot equal 1. So, we conclude that w = e/" is a 
primitive n™ root of unity. 


Oo 


The first three parts above suggest a pattern regarding which roots are primitive and which are not. When 
n = 3, we found that e”*/" is a primitive n" root of unity for k = 1 and k = 2. When n = 4, we found that 2"*/” is a 
primitive n® root of unity for k = 1 and k = 3. When n = 6, we found that e”""/” is a primitive n™ root of unity for 
k = 1 and k = 5. We might guess that e*""/" is a primitive n™ root of unity if k = 1 or k = n — 1, and nota primitive 
root otherwise. Before we dive into proving that, let’s take a look at a fifth root of unity. 


Problem 7.24: Is e/5 a primitive fifth root of unity? 


Solution for Problem 7.24: We let w = e° and find the first four powers of w. As before, we can subtract positive 
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integer multiples of 27 from the argument of each result without changing its value: 


w! = ee 
a = el2ni/s = ae 
Ww = el 8ni/5 = oe 


wut = grils = eTii, 


Since none of these first four powers of w equals 1, but w = 1, we know that w = e> is a primitive fifth root of 
unity. Also, we see that these four powers of w produce the four fifth roots of unity besides 1. O 


The result of Problem 7.24 disproves our hypothesis that only e?"/" and e""-/" can be primitive n™ roots of 


unity, since we found that e?™/5 is a primitive fifth root of unity when k = 3. Back to the drawing board. Let’s look 
back at our results above and see if we can find some other pattern in which of the n roots of unity are primitive 
n™ roots of unity. 


In looking over the results of the past two problems, we see that e2nkiln ig a primitive n root of unity if k and 


n have no common divisors besides 1, and the root is not primitive otherwise. If we test all the eighth and ninth 
roots of unity, we'll find that this pattern holds. Moreover, we'll see the same pattern we saw when we discovered 
that the first four powers of ef"! produce the four 5" roots of unity besides 1, and that the first five powers of e°"!/ 
produce the five 6" roots of unity besides 1. These observations give us a strong clue how to quickly determine 
which nt roots of unity are primitive. 


Concept: Investigating examples can reveal patterns that help you conjecture and prove 
C= general results. 


We use the notation gcd(p, q) to refer to the greatest common divisor of p and 4q. 


} i if k 1 } . . oar t i f . 


(b) Show that if gcd(k,) + 1, then w is not a primitive n™ root of unity. 


(c) Let a and b be positive integers such that 0 < b < a < n. Show that if w is a primitive n™® root of unity, 
then it is impossible to have w* = w’. Why does this show that the first n — 1 powers of w are the n — 1 
n roots of unity besides 1? 


Solution for Problem 7.25: 


(a) Suppose that q is a positive integer such that œ” = 1, which means e?""/" = 1. Therefore, 2nkpi/n must be 
an integer multiple of 277, so kp/n is an integer. Since k and n have no common factors besides 1, the only 
way kp/n can be an integer is if n divides p evenly. Therefore, we can only have œw” = 1 if p is a multiple of 
n. This means that there is no p with 1 < p < n — 1 such that œ” = 1, so wis a primitive n" root of unity. 


(b) Westart just as in the previous part, with p being a positive integer such that œ” = 1, which means e?"?!/" = 1, 
Therefore, 27kpi/n must be an integer multiple of 27i, so kp/n is an integer. However, if gcd(k,) # 1, then 
some factors of n cancel with factors of k in the fraction kp/n, which means there is an equivalent fraction 
with a smaller denominator. When p equals this denominator, the fraction is an integer. Therefore, we can 
have a value of p smaller than n such that w? = 1. 


To make this argument more formal, we can let d = gcd(k,n). Since d divides n and k, we have n = dt 
and k = du for some positive integers t and u, so k/n = u/t. We can now write w as w = e™*/" = emilt Now, 
when we raise w to the t power, we have: 


anit : 
wt = Cae = ecm 
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Since u is an integer, we have e?™ = 1, which means «t = 1. Moreover, since n = dt and d > 1, we have 
0 < t < n, so œ' = 1 tells us that w is not a primitive n® root of unity. 


er : 1 \ ab 
If w = w, then dividing both sides by œ” gives w*” = 1. Since w™? = (ee ay = een we must 
have 
e2rk(a—b)i/n =i. 


Therefore, 2rk(a — b)i/n must be an integer multiple of 27i, which means k(a — b)/n must be an integer. 
However, since w is a primitive n™ root, we must have gcd(k,n) = 1. So, if k(a — b)/n is an integer, then n 
must divide a — b. But we have 0 < b < a < n, which means that 0 < a — b < n, so it is impossible for n to 
divide a — b. We have reached a contradiction, which means that our original assumption, œ” = w’, must 
be false. We conclude that no two of the first n — 1 powers of w are equal. 


Moreover, we can conclude that the first n — 1 powers of any primitive n™ 


roots of unity besides 1. To see why, we note that because w is an nt 


root of unity give us the n— 1 


th root of unity, we have w” = 1, so 


n 


CN so F =a 
which means w? is an n® root of unity for any p. Because w is primitive, none of the first n — 1 powers of w 
equals 1 and no two of the first n — 1 powers of w are equal. Therefore, these first n — 1 powers of w are the 
n® roots of unity besides 1. 


Important: For any positive integer n greater than 1, we have the following: 
| 
| W e The complex numbers 

el, gi gtin ga Mr e2(n-1)ni/n 


are the n different n™ roots of unity. 


th 


e If gcd(k,n) = 1, then w = e2nik/” is a primitive n™ root of unity, and 


sar ar eco 


_ re eee ee 


are all of the nt roots of unity. 


J rxercises i 


7a 
7.3.2 


Find all x that satisfy xê = 1. Which solutions are primitive eighth roots of unity? 


If the six solutions of x° = —64 are written in the form a + bi, where a and b are real, then find the product 


of the solutions witha > 0. (Source: AHSME) 


7.3.3 
7.3.4 


(a) 


(b) 
736 


Let w be a sixth root of unity. Find all possible values of w + w. 


Show that each 12 root of unity is a primitive k" root of unity for some positive integer k. What are the 
possible values of k? 


For each integer k in part (a), count how many 12 roots of unity are primitive kt roots of unity. 


Show that if w is a primitive n™ root of unity for n = 2m, then w is a primitive m" root of unity. 
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7.3.6 


(a) Let æ bea cube root of unity, and let £ be a fifth root of unity. Prove that af is a 15 root of unity. 


(b) Let abea primitive cube root of unity, and let $ be a primitive fifth root of unity. Prove that af is a primitive 
15" root of unity. 


(c) Let æ be a primitive fourth root of unity, and let $ be a primitive sixth root of unity. Is æf necessarily a 
primitive 24" root of unity? 


7.3.7* We say that a polynomial is irreducible over the rational numbers if it cannot be factored as the product 
of polynomials with smaller positive degree and rational coefficients. Find a degree 4 polynomial with leading 
coefficient 1 and rational coefficients such that the polynomial is irreducible over the rational numbers and all the 
polynomial’s roots are all twelfth roots of unity. (Source: HMMT) Hints: 182 


7.4 Problems Involving Roots of Unity 


a 


Problem 7.26: In this problem, we find the roots of the polynomial f(x) = x +x? +2% +25 4244+ 427 4x41. 
(a) What kind of series is the expression x? + x7 + xê + x + xt +x 427 +x4+1? 


- (b) Viewing f(x) as a series of the type you determined in part (a), find another expression for f(x). 


(c) Find the roots of f(x) by determining the values of x for which your expression in part (b) equals 0. (You 
can leave your answers in exponential form.) 


Problem 7.27: Find the roots of g(z) = zê + z4 +27 +1. 


| Problem 7.28: Let w = e*/’, Evaluate (2 + w)(2 + w*)(2 + a )(2 + w*)(2 + w°)(2 + w6). Hints: PLEO OED 


| Problem 7.29: Find all solutions to zf + z4 + z2? + z? + 1 = 0. (Source: AIME) Hints: 263 ai 


Problem 7.30: Find the sum of the n'™ roots of unity. 


Problem 7.31: Let n be a positive integer. In this problem, we evaluate the sum 


cos a e a He 5 en 
— te ee eee (@) LS 5 
2n+1 2n+1 2n+1 2n +1 
27 T 3-27 m-27 : PY 
(a) Evaluate the sum cos = + COs + cos fo COS , where m is a positive integer. 
(b) Evaluate the sum cos Z hen l Ps ea +++ + CO a where ni itive int 
valuate um cos n = o a + CC e nis a positive in è 
2n +1 2n +1 2n +1 2n +1’ P ah 


Problem 7.32: The equation 
x! + (13x-1)' =0 


has the 10 complex roots 11, 71, 12, 72, 13, 73, 14, Ya, 5, 75. Find the value of 


1 1 1 1 


a fe m 


mM to) tafs raf4 fsfs 


(Source: AIME) 
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Problem 7.26: Find the roots of the polynomial fx) sabe x’ POP Aa ee aT 


Solution for Problem 7.26: If we're very experienced with polynomials, we might recognize f(x) as a factor of x? — 1. 
But if we don’t see this right way, all is not lost. The polynomial f(x) is a geometric series; each term in the sum is 
produced by multiplying the previous term by a constant ratio. By writing f(x) as 


y= ee ee ee eee any 


we see that we can think of f(x) as a geometric series with first term 1 and constant ratio x. To simplify this sum, 
we multiply both sides by x, which gives 


E) ee ae ee ee ee 


Subtracting our equation for f(x) from our equation for x f(x) gives (x — 1)f(x) = x? — 1. What’s wrong with this 
finish: 


Bogus Solution: Dividing both sides by x—1 gives _ So 
fi) = Ee. 


If x? — 1 = 0, then f(x) = 0. The solutions to x? — 1 = 0 are the ninth roots 
of unity, so the roots of f are the 9 ninth roots of unity. 


Our first clue that this is incorrect is that f has degree 8, so it must have 8 roots, not 9. Where we went wrong 
is that the step from (x — 1) f(x) = x? — 1 to f(x) = — is only valid if x # 1. We must have x # 1 to avoid dividing 
by 0. When x = 1, our original expression for f(x) gives us f(1) = 9. When x # 1, we do indeed have f(x) = eal, 
so f(x) = 0 when x + 1 and x? — 1 = 0. Therefore, the solutions to f(x) = 0 are the ninth roots of unity except for 


x = 1. So, the roots of f are l l 
ea eae Bone ue goea. ge? ga et67i/9 
(El 


The key step in the solution to Problem 7.26 was noticing that the polynomial is a geometric series. This 
observation is particularly useful with polynomials of the form x" + x"! + x"? +-.-+1. 


Concept: Many problems involving polynomials of the form 


xe xt hg te td 
can be solved by writing the polynomial in the form 


xntl_ 4 
v=] í 


and using our knowledge of roots of unity. 


Solution for Problem 7.27: Asin Problem 7.26, the polynomial is a geometric series. It doesn’t have exactly the same 
form as the polynomial in Problem 7.26, but hopefully the same strategy will work. Multiplying g(z) = z°+z*+2z7+1 
by 2? gives 

z9(z) = 28 +29 +24 427. 
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Subtracting the equation for 9(z) from this gives (z? — 1)g(z) = zê — 1. If z? # 1, then we can divide both sides by 
z? — 1 to get 

2-1 
z2-1 

(We can also use the formula for summing geometric series to reach this expression for g(z).) Therefore, we have 
g(z) = 0 if zë = 1 and z* # 1, which means that the roots of g are the eighth roots of unity except for 1 and -1. 
These are 


g(z) = 


Ci gE, pee au CES eTil, 


oO 


Problem 7.28: Let w = e!"/7, Evaluate (2 + w)(2 + @)(2 + 0)(2 + w4)(2 + w°)(2 + w6). 


Solution for Problem 7.28: First, we recognize that w is a seventh root of unity, because œ” = (e4"/7)" = ef” = 1. 
Moreover, w is a primitive seventh root of unity, so w, w”, œw, wt, œw, and w® are the six seventh roots of unity 
besides 1. But how does that help? 


We could multiply the whole desired expression out. That will work, but it will take a fair amount of diligence 
and patience. Perhaps we can do something clever instead. Where else have we seen a product of binomials 
involving the seventh roots of unity? 


Concept: ‘When working with a difficult expression, think about where else you have seen 


O= similar expressions. 


We encounter the product of binomials involving the seventh roots of unity in the factored form of z” — 1: 


eG E E (2 — te aa 1G te), 


where the r; are the seventh roots of unity besides 1. We’ve already seen that these roots are w, w*, w, w*, w, and 
w® (in some order), so we have 


z’ -1=(z-1)(z-0)(z - @*)(z - w)(z — wt) (z — a) (z — wô). 
Aha! We can make the desired expression appear by letting z = —2: 
(—2)’ -1 = (-2 - 1)(-2 - w)(-2 - w)(-2 — w*)(-2 — w*)(-2 — w”) (-2 — wô). 
Factoring —1 out of each of the final six expressions in parentheses gives us 
—129 = —3(-1)°(2 + w)(2 + w)(2 + w )(2 + w*)(2 + w°*)(2 + wô), 


50 


43. 


(2 + w) (2 + w*)(2 + œ )(2 + w*)(2 + w”)(2 + w6) = = = 


Problem 7.29: Find all solutions to zô + z4 + z +z? + 1 = 0. (Source: AIME) 


Solution for Problem 7.29: If only the zf were just z, then we could find the roots easily; the roots of z4 +23 +z2 +z +1 
are the fifth roots of unity besides 1: 

2-1 

z-1° 


A447 +z+1= 
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Aha! Adding z° —z to both sides makes the left side the polynomial whose roots we seek, and we can factor the 
right side further: 


5 5 
+z” -z = —— 4+2(z2?-1 
z? =z Sem ( ) 


ZÉ +z +2 +2+1= 


_ z2 -1+z(z-1(z-1) 
R z-1 
_ (2 -241)(2 -1) 
7 z-1 l 
We can factor z° — 1 as (z — 1)(z4 + 2° + z2? + z + 1). The z — 1 cancels with the denominator and leaves 
A a E E 


The roots of z4 + z? + z? +z + 1 are the fifth roots of unity besides 1. We could use the quadratic formula to find the 
roots of 2° — z + 1, or we could note that z° + 1 = (z + 1)(2? — z + 1), so the roots of z? — z + 1 are the nonreal cube 
roots of —1. Since —1 = e™, the nonreal cube roots of —1 are e”/3 and &7/3. 


Therefore, the solutions to the given equation are e27/5, e47#/5, e6Ti/5 e8ni/5 | pni/3 e57i/3, p 


Sidenote: In hindsight, it isn’t surprising that the nonreal fifth roots of unity are also roots | 
N of zf +zt +z +2? +1. For any fifth root of unity, we have z° = 1, so 2® = 7 -z =z. 
Therefore, if z is a nonreal fifth root of unity, we have 


A be pa b dalaga a pa i 


which equals 0 because the nonreal fifth roots of unity are the roots of the polyno- 
mial zt +z’ +z? +z+1. 


—— 
|Problem 7.30: Find the sum of the n® roots of unity. j 


Solution for Problem 7.30: First, we write out the sum: 


giin di gemin J gz znin + eo2ni/n fe ek2ni/n rarer’ 2-1) 2ni/n 


Here are two ways to tackle this sum: 


Oni/n 


Method 1: It’s a geometric series! This sum is an n-term geometric series with first term e-™’" and common ratio 


e2T4/" so its sum is 
eoni/n pi eoni/n : ca x ie Prva $ T= ean 2 1—1 s 


1 — e2ni/n 7 Yemen © 1 — pera 1 — e2ni/n 


Method 2: Consider the polynomial z" — 1. The terms in the sum are the roots of z” — 1, because they are the n® 
roots of unity. We can see that the sum of the roots of this polynomial is 0 by writing the polynomial in the form 


(2 — r1)(2 — 12)(Z — 13) ++ (@ — tn), 


where the 7; are the roots of the polynomial. To form each term in the expansion of this product, we choose one of 
the terms in each binomial, and then take the product of all the chosen terms. The full expansion includes every 
possible product we can form in this manner. For example, when we take -rı from z — rı and take z from each of 
the other n — 1 binomials, we get —rz""'. Continuing in this way, the only way we can form a term with z""! is 
when we take —1; from one factor and z from each of the other n — 1 factors. Grouping all of these products gives 


us 
2°17 — 12 — 13 —- ++ — fn) 
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as the z”"! term in the expansion. However, the product (z-11)(z—72)(z—13)*** (2-1) equals z” — 1, and there is no 
z’~1 term in the polynomial z” —1. Therefore, the coefficient of z”! in the expansion of (z—11)(z-12)(z—T3) *-* @—Tn) 
must be 0. We just found that this coefficient is the opposite of the sum of the roots, so the sum of the roots is 0. 0 


|Sidenote: It’s not just the sum of the roots that we can quickly determine from the coefficients 
N of a polynomial. Consider the expansion of the product 


(z = ri)(Z — r2)(z — r3): - (Z — fn). 


| The highest degree term is z”. As just explained, the coefficient of the 2"! term 
| in the expansion is —(r1 + f +13 +++: +n). But what about the z”? term in the 
expansion? 
To form az"? term, we can take —r,; and —r2 from the first two factors, and z 
from the other n — 2 factors, which gives us r1r2z"77. We can do similarly to form 
every possible term rj7;z”* with i + j, so the z”? term in the expansion is 


(fe + tira trite P ian 


| Therefore, the coefficient of z”~? is the sum of all possible pairs of roots. 

| The coefficient of z”! is (—1)! times the sum of the roots. The coefficient of 
| z"~? is (-1)? times the sum of all possible products of two of the roots. Continuing | 
| in this vein, we can show that the coefficient of z”? is (-1)° times the sum of all 
| possible products of three of the roots, z” is (-1)* times the sum of all possible 


| products of four of the roots, and so on. 
| These relationships are often referred to as Vieta’s formulas, and they are 
__ covered extensively in Art of Problem Solving’s Intermediate Algebra book. 


Problem 7.31: Evaluate the sum cos 


+c E cae a il here n i iti 
-++ + cos ———, where n is a positive 

In+1 Int] 2n+1 ui P 

integer. 


k-2r 


Solution for Problem 7.31: The sum consists of terms of the form cos 
(2n + 1)® roots of unity. 


z7. These are the real parts of some of the 


ee 


Concept: Problems involving the real or imaginary parts of roots of unity can often be | 
solved using complex numbers, so keep an eye out for cosines or sines that are 
the real or imaginary parts of roots of unity. 


The given sum has the real parts of only n of the (2n + 1) 


sum, so we first look at a simpler problem. 


Concept: ~ We can often get hints for how to tackle a hard problem by first tackling a simpler 
Q= Version of the problem. 


th 


roots of unity. It’s not clear how to handle such a 


One simpler version of the problem is to consider the real parts of all the m 


2-27 3o Dit m- 270 
+ cos +-+-+c0S ae 


roots of unity: 


271 
cos — + cos 
m 


While we haven't tackled a sum like this, we have tackled a closely related sum—the sum of the n™ roots of unity 
themselves. We just showed that the sum of the m™ roots of unity is 0. Therefore, the sum of the real parts of the 
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th 


m`" roots of unity is also 0 (and so is the sum of the imaginary parts). This gives us 


Tl 3-27 m-2n 
+ cos —— +---+cos 
m 


= 0. 


27 
cos — + cos 
m 


How does this help us with our original problem? Letting m = 2n + 1 in the equation above produces 


27 1 AL EEL aa (2n+1)-2n _ 
n+ n+1 n+l > ne 


cos 


This contains all the terms in the desired sum, plus a bunch more terms: 


oR 2 ngs I een Ps ie eee: CA gg nt D-2m gga) 
on+1 on +1 n+l n+l n+1 S he 
ee 
Desired sum Extra terms 


We do at least know how to handle one of the extra terms: the last one simplifies to cos 27, which is 1. But 
what about the rest? 


Here are two methods for tackling the rest of terms and finishing the problem: 


Method 1: Pair up the angles in the sum. We can often handle complicated sums (or products) of trig expressions 
by exploiting relationships among the angles in the terms. Here, we can pair each angle in the desired sum with 


an angle in the “extra terms” such that the sum of the angles is 27. For example, 22; + ee a i 


3 ’ 2n+1 rf — ssh = 2T. 
Since cos(27 — 0) = cos 0, we have 


(n+1)- 27 (n+2)-2n | y cos ZRT cos 2 g cos AD 2% | is 1-27 
n+] on +1 me | eT n+ O° ee 


Hence, we can now write Equation (7.1) as 


2(cos =" + 22a +cos =T) +1=0 
se Or mri a 


from which we see that the desired sum equals -4. 


Method 2: Geometry. We can visualize the (2n + 1) roots of unity as points equally Im 

spaced on the circumference of the unit circle, starting with the root at 1, as shown at 
right. The picture tells the story. The roots above the x-axis correspond to the terms in 
the desired sum. The roots on and below the x-axis are the “extra terms.” The roots 
below the x-axis are the reflections of the “desired” roots over the x-axis, so the real part 
of each of these roots equals the real part of one of the “desired” roots. Therefore, the 
sum of the real parts of all the roots is twice the desired sum plus 1. Since the sum of the 
real parts of all of the roots must equal 0, we conclude that the desired sum is —}. o 


SS ee O 
Problem 7.32: The equation 


Re 


ree (kein = 10 
has the 10 complex roots 11, 71, 12, 12, 3, 13, 14, 74, 15, 75. Find the value of 


1 1 1 1 1 


— + — + — +- HM. 


mr, fer. arg T4f4 15S 


(Source: AIME) 
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Solution for Problem 7.32: We sure don’t want to multiply out (13x — 1)'°. Perhaps we can rewrite the equation in 
a more convenient form. Subtracting (13x — 1)'° from both sides gives 


10 — —(13x — 1)". 


This is close to an equation we know how to solve. We know what to do with x! = 1, and what to do with x” = c 
for some constant c. While we're not so sure what to do with x!° equal to an expression, we can rewrite this 
equation as the tenth power of an expression equal to a constant by dividing both sides of x = —(13x — 1)"° by 


(13x — 1)!°, which gives 
x \10 
T - u on 


(Note that we can take this step without worrying about dividing by 0 because x = 1/13 is clearly not a solution 
to x!0 = —(13x — 1)!°, since x = 1/13 makes the right side 0 and the left side nonzero.) 


Now, we have an equation that’s a lot more familiar. 


‘Concept: A great many equations can be solved by manipulating the equation into a form | 


you already know how to tackle. 


To make this equation exactly like an equation we have solved before, we can let t = zz, SO our equation is 
simply f!° = —1. We can now use Problem 7.22 as a guide to find t. We have t = —1 = e™, so the 10 solutions for 
t are e™/10 times each of the tenth roots of unity. Since we can express each tenth root of unity as e?""/19 for some 
integer k with 0 < k < 10, each solution for t has the form t = e7#/10 . e2"ki/10 = e@k+1)ni/10 for some integer k. 


We now must find x in terms of t. Multiplying both sides of t = 
Therefore, 


by 13x — 1 gives 13tx — t = x, s0 x = yy. 


ee 1 
i o(2k+1)ni/10 
13t—-1  13e@k+1)ni710 — 1] 


is a solution of the given equation for any integer k. 


—— 


That’s not a pretty expression, but we aren’t asked for the solutions of the original equation—we must evaluate 
an expression in terms of these solutions. Hopefully, once we plug these solutions into the desired expression, 
something convenient will happen. Let’s try the first term in the desired expression lfm = is one 


solution, then 
1 TAF a ) Pigiai ) 
i Po AS 


= 
(2 (= -1 ) 


A (13¢ - 1)(13# - 1) i 
tt 
_ 169f — 13t- 137 +1 
7 tt i 


la ias 
2 ND 13f-1 


Before we substitute for t, we notice that ff = |t? = 1 because t = e@#1)r#/10 and |e!| = 1 for any angle 9. 


Therefore, we have 5 7 ? 
1 169tt — 13t- 13t+1 169-13(t+t)+1 r 
Se ed eee O o 
1111 tt 1 


Since £ = e(@k+)ni/10 = e“2k+1)7i/10 we have 


a =4170= 13 ( e(2k+1yri/10 eneen) l 
nny 
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We still have two exponentials in this expression, but their exponents are opposites. We've seen this before, 
back in Problem 7.16, where we showed that cos 0 = (e? + e~)/2. So, we have e? + e~? = 2cos @. Applying this 
to our expression above gives us 

eS Were (e@krDni/10 4. e-Ch)nH/10)) = 170 - 26 cos ( 


(2k + a) 
rırı E i 


10 


This expression has 5 different possible values, one for each of k = 0,1,2,3, and 4. When k = 5, we have 
cos (Seu) = cos 4, which equals cos 77, so our expression for am is the same for k = 5 as it is for k = 4. 


Similarly, every integer k gives us a value of our expression ae ee equals one of the 5 values we find for 


nn 
k = 0 23 0r as 


Since there is a solution to x! + (13x — 1)! = 0 of the form x = e@)7#/10/(13¢@k+1)7#/10 _ 1) for each of 
the tenth roots of unity, we know that among the five terms of the form = each of the 5 possible values of 


170 — 26 cos (Sx x) occurs. Therefore, our desired sum equals 


TL 37 57 7T on 
5(170) — 26 (cos = + cos — + cos — + cos — +cos 25) : 
co) 10 10 10 10 10 
We're almost there! We have cos 57 = cos Z = 0. The other two cosines can be grouped in pairs whose angles sum 
to n, which reminds us that cos x = — cos(m — x). So, we have cos To + cos am = cos an + cos m = 0. Therefore, the 


sum of cosines in our expression for the desired sum is 0, so the sum equals 5(170) = 850. O 


(a) Find the roots of z? — z + 1 in exponential form. 
(b) Find the roots of z° + 1 in exponential form. 
(c) Find a degree 4 polynomial whose roots are the nonreal roots you found in part (b). 


(d) Find the roots of xf — x? + x* — x° + xX? — x + 1. (You can leave your answers in exponential form.) 
7.4.2 Let w be a 13" root of unity. Evaluate (1 - w)(1 - w?) -- - (1 - w1). 
7.4.3 Letz be such that z” = 1 and z # 1. Compute the numerical value of 


1 1 1 

10 30 50 

z“ + T0 +z™ + 0 +z™ + 0: 

(Source: NYSML) 

a+b- c, 
a—b+c 


m b @ ! 
7.4.4 The nonzero complex numbers a, b, and c satisfy r o. What are the possible values of 


7.4.5 Factor x’ +xt +x +x +1 into the product of two polynomials with integer coefficients and positive degrees. 


7.4.6x The polynomial 
P(x) = (eae od N — xl” 


has 34 roots of the form zy = r;,[cos(27a;,) + isin(27a,;)], k = 1, 2, . . . , 34, with 0 < a; < a2 < a3 < +- < a34 < land 
rų > 0. Find a; + a2 + a3 + a4 + a5. (Source: AIME) Hints: 209, 58 
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7.5 Summary 


The form x + yi is the rectangular form of a complex number. A complex number can also be written in the polar 
form r(cos 0 + isin 0) or the exponential form re®. We typically have r = 0 in polar and exponential form, so that 
r is the magnitude of the complex number. The @ in polar and exponential form is called the argument of the 
complex number. The rectangular form of a complex number is related to the polar and exponential forms by 
x =rcos0, y = rsin9, sor = x? + 4? and tan0 = * (when cos @ is nonzero). 


Important: De Moivre’s TI heorem states that ee w 


(cos 0 + isin)" = cosn@ + isinnO | 


for any 0 and any positive integer n. 


Important: For any angle 0, we wehave e ee. aaa | 
| , 
| TA eo + e710 | 
| 2 7 | 
| cif — g-i8 
in 0 = : | 
SS ee ee | 
me A IMM 
‘Important: The n roots of f(x) = x” — 1 are 
e, 


These are also aaen to as the n™ roots of unity, since these are the n complex | 


| 
0 ae ei PRR = pe 
j 
numbers whose n power is 1. | 


| 
| 
| 
| 
| 


—— 
Important: Ann root of unity, z, is called a primitive n™ root ot of unity if z” = 1 and z* #1 
for k = 1,2,3,.. pian 1. If A n) = 1, then we = e2nikin is a primitive n™ root of | 


3 n-1 


unity, and 1, w, w?, œ”, .. ., w”! are all of the n" roots of unity. | 


i] 


Things To Watch Out For! 


We always use radians in exponential notation, never degrees. 


Problem Solving Strategies 


Concepts: e Investigating examples can reveal patterns that help conjecture and prove 


O= general results. 


e When working with a difficult expression, think about where else you have 
seen similar expressions. 
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We can often get hints for how to tackle a hard problem by first tackling a 
simpler version of the problem. 


A great many equations can be solved by manipulating the equation into a 
form you already know how to tackle. 


De Moivre’s Theorem can help us tackle problems involving cos nO or sin n9, 
where n is a positive integer greater than 1. | 


Exponential form sometimes offers an easy way to prove facts involving 
complex number multiplication. 


The exponential form of complex numbers is a helpful way to remember 
r trigonometric identities. 


| e Many problems involving polynomials of the form x” +x") +x"? +---4+1 


| can be solved by writing the polynomial in the form ei, 
| knowledge of the roots of unity. 


and using our 


| e Problems involving the real or imaginary parts of roots of unity can often 
be solved using complex numbers, so keep an eye out for cosines or sines 
that are the real or imaginary parts of roots of unity. 


| REVIEW PROBLEMS a 
7.33 Write each of the following in polar form: 
(a) -9 (c) 8+8 
(b) V6+ V2i (d) -4-hi 
7.34 Write each of the following in rectangular form: 
(a)  4cis 120° (c) &cis$ 
(b) 5 cis 315° (d) 20cis97x 


7.35 For what values of 0 does cis 0 = cis 20? 


7.36 Determine all integer values of 0 with 0° < 0 < 90° for which (cos 0 + isin 0)” is a real number. (Source: 
ARML) 


7.37 Express each of the following in exponential form: 
(a) 24-24: (b) S854; 
7.38 Express each of the following in rectangular form: 
(a) e3mi/2 (b) 6e7 7/6 
7.39 Use the fact that e~? = ~, to reproduce the identities for sin(—6) and cos(—6). 


Z Z 
7.40 Describe all complex numbers z such that = + 2 = V3. 
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7.41 Evaluate each of the following: 


Ae Ne (-3 V3- 318 
a ( ) ©) C4 Vai 


(a) Express sin 50 in terms of sin 0. 


(b) Show that sin 46 cannot be expressed as a function of sin 0. 
7.43 Ifw-zisanonzero real number and |w| = |z|, then which of the following is true: 


(a) w+zis real. 
(b) w+zis either real or pure imaginary. 


(c) Itis possible that w + z is neither real nor pure imaginary. 
7.44 Find all solutions to z° = —4 V2 + 4 V2i in exponential form. 


7.45 Letz =e, where z + 1. Show that + = icot 2, 


(a) Find the product of the n™ roots of unity. 


th 


(b) Find the product of the primitive n™ roots of unity. 


7.47 Letn bea positive integer. How are the roots of x?™”*! + 1 related to the roots of x*”*! — 1 
(a) algebraically? 
(b) geometrically, in the complex plane? 


7.48 Let w be a nonzero complex number. Suppose the solutions to the equation z” = w are graphed in the 
complex plane. Prove that the resulting points are vertices of a regular n-gon. 


7.49 Let x, +iyn = (1+i V3)", where x, and y, are real and n is a positive integer. If x19y91 + X91y19 = 2k V3, 
compute k. (Source: ARML) 


7.50 Show that é = æ for all complex numbers z. 


7.51 The hyperbolic sine, hyperbolic cosine, and hyperbolic tangent functions are defined and denoted as 
follows: e E oe 
sinh x = ae : coshx = sii 7 tanh e 
2 P aes a 


Prove the following identities: 


(a) cosh? x —sinh* x = 1. 

(b) sinh(x + y) = sinh x cosh y + cosh xsinh y. 
tanh x + tanh y 

1+ tanh x tanh y 


(d) sinhx + sinh y = 2sinh 3% cosh =. 


(c) tanh(x+ y) = 


7.52 Ifx= =n and y = Ae then find all nonnegative integers n such that x” + y” = —1. 
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Challenge Problems 


7.53 The equation z° + z + 1 = 0 has one complex root with argument (angle) @ between 90° and 180° in the 
complex plane. Determine the degree measure of 0. (Source: AIME) 


7.54 Letzbearootof x -1 = 0, with z #1. Compute the value of 
zi} 4716 4 717 a 
(Source: ARML) 
7.55 Find all solutions to the equation z” + 8z* — 2iz? — 16i = 0. Hints: 39 
7.56 Let @ = arctan2. What quadrant does the angle 86 lie in? (No calculators!) Hints: 192 
7.57 Find all z such that z? + zê +z? = —1. Hints: 170, 283 
7.58 Describe all integers n such that the polynomial x2" + 1 + (x + 1)” is divisible by x? + x + 1. Hints: 254 
7.59 Find all z such that 32z° + 16z* + 8z? + 4z? + 2z + 1 = 0. Hints: 12 
7.60 Let x and y be two k® roots of unity. Prove that (x + y) is real. (Source: HMMT) Hints: 38 
7.61 If!” =1 and w # 1, then evaluate 
a 
1+w 1+? 1+ w19%7 
(Source: Mandelbrot) Hints: 214 


7.62 A sequence of complex numbers Zo, 21, Z2,.-- satisfies the rule 


va 
Z2n41 ==: 
Zn 


Suppose that |zo| = 1 and z2005 = 1. How many possible values are there for zo? (Source: AMC 12) Hints: 109, 70 


7.63 Thesequences 41, 42, 43, . - - and bı, bz, b3, ... are defined by a; = a,b, = B, and aņ+1 = A4n—Bbn, bn+1 = Bantaby 
for all n > 1. How many pairs of real numbers (a, f) are there such that aj997 = a1 and b1997 = bı? Hints: 152, 132 


(7) tan 0 — (3) tan? O+--: 


i- G a. . Hints: 16 


7.64 Show that tan n0 = 


7.65 Given that z is a complex number such that z + 1 = 2cos3°, find the least integer that is greater than 
27000 + ys. (Source: AIME) Hints: 257 


7.66x For a positive integer n, let D,(x) denote the polynomial whose leading coefficient is 1, and whose roots 
are the primitive n" roots of unity. These polynomials are called the cyclotomic polynomials. 


(a) Show that x” — 1 can be factored as Dg, (x)®z,(x)Pq, (x) --- Da, (x), where dı, d2, d3, . . . „dy are all of the positive 
divisors of n. Hints: 141 

(b) Find ©,(x) for 1 < n < 10. Hints: 256 

(c) Find ®,(x), where p is a prime. 

(d) Show that ifn > 2, then the degree of ®,,(x) is even. 

(e) Show that ®,(x) = ®,(—x) for all odd integers n > 2. Hints: 188 


S a Ia SUNT 
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D-O -Qer 
()-()=6)- Gentes 


7.68% If w = cos 40° + isin 40°, then express |w + 2w* + 3w? +--+ + 9w?|"! in the form asinb. (Source: AHSME) 
Hints: 165, 88 


7.67*x Show that 


and 


Hints: 272 


7.69% The sequence of functions {Tn} is defined by To(x) = 1, Ti(x) = x, and T,(x) = 2xTn-1(x) — Tn-2(x) for all 
n > 2. Show that T;,(cos 0) = cos né. 


7.70x The angles a, B, and y satisfy e" + e? + e” = 0. Show that e* = e? = 6”, Hints: 190, 77 
7.71x Show that 


cos x + cos y + cosz + cos(x + y +z) = 4cos (=) cos (==) cos (5) 


and 


+ + 
sinx + siny + sinz—sin(x + y + z) = 4sin (* =} sin (== 
Hints: 211, 130 


7.72x Find a cubic polynomial whose roots are cos 27/7, cos 47/7, and cos 67/7. Hints: 162, 25 


7.73% Let f(x) = x20 + 22003 + 32002 +... + 2004x + 2005 and z = cos (<5) +isin (z5). Express the product 


HOV) Ons TCR) 
in the form a’, where a and b are integers. Hints: 155, 73, 227 


7.74x Letv and w be distinct, randomly chosen roots of the equation z!””” — 1 = 0. Find the probability that 
\/2 + V3 < lv + w]. (Source: AIME) Hints: 139, 276 


7.75x Show that if n is a positive integer greater than 1, then 


o m a oi . (n-1)n n 
sin — SiN — sin ry 
n n n 21 


Hints: 37, 183, 142 
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Extra! What is i? 
a Ha FL 

De Moivre’s Theorem allows us to raise nonreal numbers to integer powers, and our work 
with finding roots of unity gives us some idea how to handle raising nonreal numbers to rational 
powers. But what about raising a nonreal number to a nonreal power? 

Let’s start simpler, by raising a real number to a nonreal power. We know how to raise e to a 
nonreal power, but what is 2'? The only number we know how to raise to the power i is e, so we 
write 2 as a power of e. By definition, the power that we must raise e to in order to get 2 is the 
base e logarithm of 2. We typically denote the base e logarithm as In, so, for example, Ine? = 3. 
Since we must raise e to the power In2 to get 2, we can write 2 as e™*, so we have 


2i = (el?)' = of?’ = cos(In2) + isin(In2). 


So, now we know how to raise positive real numbers besides e to nonreal powers—express the 
base of the expression as a power of e. 
We computed Jag by writing 2 as e12, So, to compute ï, we try writing i as e™!, which would 
allow us to write ï as (e"f. We don’t yet know what Ini is, but we do know Pea to write i asa 
power of e. Specifically, we have e”? = cos Ẹ + isin 3 = i, so we can choose to let Ini = ni/2, and 
we have : i 
E (ee) Sipe. se 


Weird! We raise i to the power i, and the result is a real number! 
But wait! We also have e/2 = i. So 


ja = ( Srila) = even? 


In fact we see that we could have gotten any number of the form e~™/2*4" 


There are a lot of possible values. What is going on here? 

We’d like to define i! such that it has only one value. Our work above seems to give many 
values of ï, so one of our steps may involve using a “function” that isn’t properly defined to give 
a unique output for each input. We’ve seen something like this before when we defined inverse 
trig functions. There, we had to restrict the domain of cos x in order to define an inverse. Let’s 
see if something similar is going on here. 

Remember we wrote i = e!"’. Raising e to a power is a function, since 


where k is an integer. ' 


Pet = e -e = & (cosh +isinb) = e cosb + “isinb, 


which definitely has only one possible value. But what about In? Since we wrote i = e"’, we're 

assuming that In is an inverse to f(x) = e*. In order for f to have an inverse, no two inputs can 

go to the same output. This is OK when x is real. When we allow nonreal inputs to f(x) = e*, we 
have a different story, because f(x) = f(x + 277). 

Now we see the problem! In order to build the function In we need to restrict the domain 
of f. (Remember this is exactly what we did in defining arccos, since cos itself did not have an 
inverse.) One of the most common choices for restricting the domain of f is forcing the imaginary 
part of the input to be in (~r, 77]. 

If we make this choice, then 

j= gilni = ellti/2) = enn? 
As you might guess, this is far from the end of the story. Defining In in this way introduces 
some unsettling consequences. For example, compare In e™ to In(e}00000017i). The first equals mi. 
We’d expect the second to be quite close to mi, but, no, it’s actually —0.999999971. Naturally, 
mathematicians have an answer for dealing with this oddity, too. Perhaps you'll study it in 
college when you study complex analysis or topology. 


en EE ———————ee———— 
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CHAPTER 8. GEOMETRY OF COMPLEX NUMBERS 


The shortest path between two truths in the real domain passes through the complex domain. ~ Jacques Hadamard 


CHAPTER a 


Geometry of Complex Numbers 


Many mathematical tools can be applied to a wide range of problems, including problems that seem very 
unrelated to each other. This chapter exhibits various deep relationships between two seemingly unrelated 
areas of mathematics: complex numbers and geometry. As you work through the following problems, keep in 
mind that many significant advances in mathematics have come from the realization that two seemingly different 
fields of mathematics are strongly related, or even the same. 


8.1 Transformations of the Complex Plane 


We start our investigation of the geometry of complex numbers with geometric transformations. 


e Translation. When we apply a translation to a figure, we slide it a specified distance B ~ 
ina given direction. For example, points A’ and B’ at rightare the result of translating ~~ = 
A and B, respectively, in the direction and by the distance suggested by the arrows. ~ A 
So, we say that A’ is the “image” of A under the translation shown at right, or that w, 
the translation “maps” A to A’. (We use the terms “image” and “maps” in this 
manner with all transformations.) 


e Rotation. When we rotate a figure, we spin it by some angle about some point. For 
example, in the diagram at right, the image of point A under a rotation of angle 0 f 
counterclockwise about point O is the point A’ such that OA = OA’ and ZAOA’ = @,as_ 1 
shown at right. If 6 is not an integer multiple of n, then there are two points that satisfy $ 
this description of A’, which is why we must specify the orientation of the rotation as_ | 0> O 
clockwise or counterclockwise. We call point O the center of this rotation, and 0 is the 
angle of rotation. 
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e Reflection. The image of point A upon reflection over line m is the point A’ such that m 
is the perpendicular bisector of AA’. In other words, if we fold our paper along line m, 
points A and A’ will coincide. If point A is on m, then it is its own reflection. We call m the 4 
axis of reflection. 


e Dilation. At right is an example of dilation. Point O is the center of a 
dilation. The image of A is the point A’ on OA such that OA’ /OA = 2. wie 
Similarly, the image of point B is the point B’ on OB such that OB’/OB = 2, A 
and the image of point C is the point C’ on OC such that OC’ /OC = 2. BS 
The image of the center of dilation is itself. We can think of the dilation O«=2--- ay 
as “stretching” AABC away from O, and we say that AA’B’C’ is a dilation ` 
of AABC about O. Bis 


In general, the image of a point P upon dilation with scale factor k and i 
center O is the point P’ on OP such that OP’ = k(OP) (when k is positive). 
The scale factor need not be positive. If the scale factor k is negative, then 
the image of P is the point P’ on the ray from O going in the opposite direction from P such that OP’ = |k|OP. 


We say that translations, rotations, and reflections are isometries, which means that they preserve distance. We 
can use this fact to show that a segment and its image under an isometry have the same length, and an angle and 
its image under an isometry have the same measure. A dilation is not an isometry, because it does not preserve 
length. It does, however, preserve angle measure, so, for example, ZABC = ZA’B’C’ in the dilation shown above. 


We sometimes wish to speak about geometric figures in the complex plane that are more complicated than 
points, such as segments or triangles. Doing so with the notation of complex numbers could be confusing. For 
example, “wz” typically refers to the product of complex numbers w and z, not the segment from w to z in the 
complex plane. So, we usually stick to the notation of geometry when referring to such geometric objects in the 
complex plane. By convention, we use capital letters to denote points and the corresponding lowercase letters to 
denote corresponding complex numbers. 


For example, in the diagram at right, point P corresponds to the complex number p Im 
and point Q corresponds to the complex number q. We refer to the segment that connects p 
p and q as PQ. While we usually use capital letters for any geometric object besides a 
point, we will sometimes refer to points in the complex plane by the complex numbers 
to which they correspond. So, we could describe the situation depicted at right as either 
“p is a 180° rotation of q about the origin” or “P is a 180° rotation of Q about the origin.” 


[___Probtems ii ‘ 


Problem 8.1: 
(a) What transformation maps any complex number z to the complex number z + 2 — 31? 


Re 


(b) What transformation maps any complex number z to z + w, where w is a constant complex number? 


Problem 8.2: 
(a) Ifz=cos12° +isin 12° + cos 48° + isin 48°, then what is the argument of z? 


(b) Find Im ((cos 12° + isin 12° + cos 48° + isin 48°)°). 
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Problem 8.3: If w and w — i are both n™ roots of unity, then what are the possible values ofn? 


Problem 8.4: 
(a) What transformation maps z to 2z? 


(b) What transformation maps z to cz, where c is a positive real constant? 


Problem 8.5: 
(a) How arez and iz related to each other in the complex plane? 


(b) Generalize your result from part (a) by finding a relationship between z and e'z in the complex plane. 


Problem 8.6: Describe in words the transformation that maps z to re! .z, where r is a positive real constant and 
0 is a real constant. 


Problem 8.7: Let g be the result of rotating p an angle 0 counterclockwise about w in 
the complex plane. In this problem, we find q in terms of 0, p, and w. 


(a) Let points P, Q, and W correspond to complex numbers p, q, and w, respec- 
tively. Consider the translation that maps W to the origin, O. Let this transla- 
tion map P to P’ and Q to Q’. Find the complex numbers that correspond to 
P’ and Q in terms of p, q, and w. 

(b) What is POQ’? 


Why are P’ and Q’ equidistant from the origin? 


Find q in terms of p, w, and 8. 


Problem 8.8: A pirate buries his treasure on a deserted island with an oak tree, a pine tree, and a gallows. 
To choose the burial location, he starts at the gallows and walks to the oak tree, counting his steps. When he 
reaches the oak tree, he turns right 90 degrees, walks the same number of steps as from the gallows to the oak 
tree, and places a spike in the ground. 

He returns to the gallows, and then walks to the pine tree, again counting his steps. When he reaches the 
pine tree, he turns left 90 degrees, walks the same number of steps as from the gallows to the pine tree, and 
places another spike in the ground. He then buries the treasure at the midpoint of the segment with the spikes 
as endpoints, and then removes the spikes from the ground. 

Years later, the pirate returns to the island to get his treasure. Unfortunately, the gallows are gone, and there 
is no sign of where it had been. Fortunately, the oak tree and pine tree are still there. In this problem, we help 
the pirate find his treasure. We do so by putting the problem on the complex plane. 

(a) Let g correspond to the gallows, p to the pine, and k to the oak. Find complex numbers in terms of g, p, 
and k that correspond to each spike and to the treasure. 


(b) When we place a geometry problem in the complex plane, we can choose where to place the origin. 
Suppose we choose the midpoint of the segment connecting the two trees as our origin. Then, how are p 
and k related? Why is this choice of origin so convenient? 


Where is the treasure? 


Problem 8.1: 
(a) What transformation maps any complex number z to the complex number z + 2 — 3i? 


| (b) What transformation maps any complex number z to z + w, where w is a constant complex number? 
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Solution for Problem 8.1: 


(a) We investigate by trying a few values of z. At right, we have 
plotted 2 + 4i, -3 + 51, and —2i, and the results when we add 2 — 3i 
to each. In each case, the result of the addition is 2 units to the 
right and 3 units below the original complex number. Writing z 
as a + bi makes it clear why z + 2 — 3i is a translation of z: 


Z+2-3i=at+bhi+2-3i=(a+2)+(b-3)i. 


The effect of adding 2 — 3i to z is to add 2 to the real part (move 2 
units right) and subtract 3 from the imaginary part (move 3 units 
down). Therefore, the result is a translation of z by 2 units right 
and 3 units down. 


(b) From our work in part (a), we aren’t surprised to find that the 
transformation that maps z to z + w is a translation (even if w = 0, 
which gives us the “do nothing” translation). If we let w = c + di, we see that z + w adds c to the real part of 
z (moves c units horizontally) and adds d to the imaginary part of z (moves d units vertically). Therefore, 
z + W is a translation of z by c units horizontally and d units vertically. 


Solution for Problem 8.2: We could start by raising that whole expression to the sixth power, but that looks difficult. 
The angles are conveniently related, so we might try trig identities. This approach will work, but thinking about 
the problem geometrically gives a much faster solution. 


Letting w = cos 12° + isin 12° and z = cos 48° + isin 48°, our problem is to determine 
Im ((w + z)°). We know how to find w + z in the complex plane when we have w and z. 
Specifically, we let points W, Z, and S in the complex plane correspond to w, z, and w+ z, 
respectively, as shown at right. As we saw in Problem 8.1, adding a constant complex 
number w corresponds to a translation in the complex plane. Adding w to 0 and to z 
give w and w + z, respectively. So, the same translation that maps O to W also maps Z to 
S. Therefore, OW = ZS and OW || ZS, which means that OWSZ is a parallelogram. (We 
also saw this when we introduced graphing on the complex plane in Section 6.2.) Since 
jw] = |z|, we have OZ = OW, so this parallelogram is also a rhombus. 


Since OWSZ is a rhombus, diagonal OS bisects ZZOW. From the arguments of W and Z, we know that 
LWOA = 12° and ZZOA = 48°, so LWOZ = 36° and ZSOW = ¿WOZ = 18°. This means that ZSOA = 30°, which 
is an angle we like. Since ZSOA = 30°, the aument of w + z is 30°, which means we have w + z = re™/® for some 


positive constant r. Therefore, (w +z)É = (re™ y = fe" = rê(—1) = —1°. We don’t know exactly what the real part 
of this number is, but we know its imaginary part is 0, so Im((w + 29 F0 el 


Our solution to Problem 8.2 looks a lot longer than it really is. Once you’re comfortable with the geometry of 
complex numbers, the observation that w + z has argument 30° is almost immediate, which makes determining 


that Im((w + z)°) = 0 very easy. 


Let’s take another look at a problem that is simplified by thinking of adding complex numbers as a translation. 


Problem 8.3: If w and w —iare both n™ roots of unity, then what are the possible values ofn? 
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Solution for Problem 8.3: If we start algebraically, we have œ” = (w — i)” = 1. Um, then what? We sure don’t want 
to expand (w — i)". We might write w and w —iin polar form and try to bash away with sines and cosines (you can 
take a swing at that on your own), but before we do that, let’s think about the problem geometrically. 


We let points A and B in the complex plane correspond to w and w —i, respectively. Since w and w —i are both 
roots of unity, both are on the unit circle. The point w — i is a one-unit downward translation of w, so AB is vertical 
with AB = 1. Moreover, since A and B are on the unit circle, we have OA = OB = 1, which means that AOAB is 
equilateral. Therefore, ZAOB = 60°. 


Since AB is vertical and no two points on the unit circle above the real axis are on the 
same vertical line, we know that A and B cannot both be above the real axis. Similarly, 
they cannot both be below it and neither can be on it, so one is above the real axis and 
the other is below it. Since B is below A, we know that A is above the real axis and B is 
below it. The top half of the unit circle is the reflection of the bottom half over the real 
axis, so the real axis must bisect ZAOB. This tells us that AO makes a 30° angle with the 
real axis, limiting A and B to the two possibilities shown in the diagram, with points A, 
and B; forming one possible pair and points Az and Bz forming the other possible pair. 


eom/6 


For the first possibility we have A; = e"il6 and B, = e-™/®, and for the second we have A, = and 


B2 = e “4/6, Now, we're ready to find n. 


If w = e™/® is an n™ root of unity, we must have (e/ 6)" = 1, so e™/6 = 1. This means that min/6 must be an 
integer multiple of 27, so min/6 = 271k for some integer k. Solving for n gives n = 12k, which means that n is a 
multiple of 12. Similarly, if w = e7™/6, then we have (e5"/6)" = 1. This gives us 5rin/6 = 2nik for some integer k, 


from which we find 5n = 12k. Once again, this means that n is a multiple of 12. 


We must still confirm in both cases that w — i is an n™ root of unity when n is a multiple of 12. When w = e™6, 
we have w —i = e-™/® and 
wots Guy a3 (emire) ™ a ai, 
as desired. Similarly, when w = 0/6, we have w — i = e™/* and 


o ( a) = ( play -ginik _ 7 
(We could also have noted that in both cases, we have w — i = w, and we showed in Problem 7.21 that if w is a root 
of unity, then so is œ.) 


We can also use geometry to visualize the fact that n must be a multiple of 12. Our options for A and B are all 
primitive 12" roots of unity, so the first 12 powers of each form the vertices of a regular dodecagon in the complex 
plane, as shown at left below. 


Im Im Im 


Re Re Re 


Figure 8.1: 12 Roots of Unity Figure 8.2: 24" Roots of Unity Figure 8.3: 36" Roots of Unity 


The diagrams above exhibit how the 12" roots of unity are related to the 24" and 36" roots of unity. Similarly, 
whenever n is a multiple of 12, the n™ roots of unity include the 12" roots of unity. Moreover, if a regular n-gon 
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with all its vertices on the unit circle includes these 12 roots of unity among its vertices, then n must be a multiple 


of 12, because there will be an equal number of vertices of the n-gon between each two consecutive 12" roots of 
unity. O 


Problem 8.4: 
(a) What transformation maps z to 2z? 


(b) What transformation maps z to cz, where c is a positive real constant? 


Solution for Problem 8.4: 


(a) We start by exploring a few examples. In the diagram at right, we hth 
have 2 + i, -3 + 2i, and 1 — 2i, and the results of multiplying each ~ 2-3 +2i) 
of these by 2. It appears that in each case, the result 2z is on the ray . 
from the origin through z, and that 2z is twice as far from the origin >s 
as z is. We can see why this occurs by looking at the exponential 
or polar form of z. Writing z as re®, we have 


2z = 2re'®, 


"e 


> e are: 
9ta 2+- 2(2 +1) 


So, the magnitude of 2z is twice the magnitude of z, and the argu- 
ment of 2z is the same as that of z. Because the arguments of 2z and 
z are the same, they are on the same ray from the origin. Therefore, 
we conclude that 2z is a dilation of z about the origin with scale 
factor 2. 


(b) Our previous part gives us a clear guide. Writing z in exponential form gives us cz = cre? , so the magnitude 
of cz is c times the magnitude of z, and the argument of cz is the same as that of z. Therefore, cz is on the ray 
from the origin through z, and is c times as far from the origin as z, which means cz is a dilation of z about 
the origin with scale factor c. 


o 


If c is negative, then the transformation that maps z to cz is still a dilation; it just has a negative scale factor, 
which means that cz is in the opposite direction from the origin from z. 


Problem 8.5: 
(a) How are z and iz related to each other in the complex plane? 


(b) Generalize your result from part (a) by finding a relationship between z and e'z in the complex plane. 


Solution for Problem 8.5: 


(a) First, we experiment with a few values of z. If z = 1, then iz = i. If 
z = 1+5; then iz = i(1+5i) = -5+ i. If z = —4- 2i, then iz = i(—4 — 2i) = 
2 — 4i. The latter two examples are depicted at right. In each of these 
three cases, we see that iz is a } counterclockwise rotation of z about 
the origin. 
Now that we know what to look for, we have a pretty strong clue 
how to prove it. Angles and complex numbers suggest using expo- 
nential form. Letting z = re? and noting that i = e"/*, we have 


ree - re? = reil9+3), 


Therefore, iz is just as far from the origin in the complex plane as z is, 
but its argument is 4 greater. That is, iz in the complex plane is a | counterclockwise rotation of z about 
the origin. 
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(b) The previous part gives us a guide. We let z = re?, and we have 


ia 


en =e pee ae 


So, ez and z are the same distance from the origin, but the argument of ez isa greater than the argument 
of z. Therefore, e'*z is the result of rotating z counterclockwise about the origin by an angle of a. 


When z is rotated a counterclockwise about the origin in the complex plane, 
the result is e’“z. 


Important: 


Y 


In Problems 8.1, 8.4, and 8.5, we explored a specific case of a transformation and then used our specific case 
as guidance to find a more general result. This is a very common and important strategy for finding proofs of 
general statements. 


| Concept: Many general results can be discovered by first exploring simple examples. | 


Problem 8.6: Describe in words the transformation that maps z to re” - z, where r is a positive real constant 
and @ is a real constant. 


Solution for Problem 8.6: The transformation that maps z to re? -z is simply a combination of the transformations 
we learned in Problem 8.4 and Problem 8.5. From Problem 8.5, we know that e?z is the result of rotating z an 
angle of 0 counterclockwise about the origin. From Problem 8.4, we know that r - (e?z) is the result of dilating 
e?z by a scale factor of r from the origin. Therefore, the transformation that maps z to re’? -z is a counterclockwise 
rotation by an angle 0 followed by a dilation with scale factor r. O 


Saale 8.7: If q is the result of rotating p an angle 0 counterclockwise about w in the complex plane, then | 
find q in terms of 6, p, and w. 


Solution for Problem 8.7: Let Q be the image of a counterclockwise rotation of P about Im 
W by 9, as shown, and let points P, Q, and W correspond to complex numbers p, q, 
and w, respectively. The only rotations we know how to handle are rotations about the ie 

origin. So, we turn this problem into a problem we know how to handle by translating ye P 
the center of rotation, W, to the origin. We apply the same translation to P and to Q, wN 
and we have the diagram at right, where P’, Q’, and W’ are the images of P, Q, and W 
under the translation that maps W to the origin. 


d 
The translation that maps W to the origin must map the complex number w to 0. ve Re 
Therefore, this translation maps any complex number z to z — w. Since P’ is the image of P under this translation, 
P’ corresponds to the complex number p’ = p — w. Similarly, we have q’ = q — w, where g’ corresponds to Q’. 


Next, we note that since Q is the image of P upon rotation about W, we have QW = WP. Since translating 
a segment does not change its length, we have Q’W’ = W’P’ as well. Translating an angle does not change its 
measure, so ZQ’W’P’ = ZQWP = @. Since Q’W’ = W' P’ and ZQ’W'F” = 6, point Q is the image of point P’ upon 
a counterclockwise rotation of angle @. 


The image of z upon a counterclockwise rotation by @ about the origin is ez. Since Q’ is the image of P’ under 
such a rotation, we have q’ = ep’. Substituting our expressions above for p’ and q’, we have g — w = e° (p — w). 
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Solving for q gives 
g=e(p—w)+w. 


If q is the result of rotating p about w in the complex plane counterclockwise by 
an angle of 0, then 


q= (p -w) +w. 


S | 


o 


Important: 


This is often easier to visualize in the form 
q-—W= e? (p = w), 


for the reasons we discussed in our solution to Problem 8.7: q — w and p — w are the images under the translation 
that maps w to the origin, and e° (p — w) is the image upon rotating p — w by @ about the origin. 


[Problem 8.8: A pirate buries his treasure on a deserted island with an oak tree, a pine tree, and a gallows. 
| To choose the burial location, he starts at the gallows and walks to the oak tree, counting his steps. When he 
| reaches the oak tree, he turns right 90 degrees, walks the same number of steps as from the gallows to the oak 
| tree, and places a spike in the ground. 

He returns to the gallows, and then walks to the pine tree, again counting his steps. When he reaches the 
pine tree, he turns left 90 degrees, walks the same number of steps as from the gallows to the pine tree, and 
‚places another spike in the ground. He then buries the treasure at the midpoint of the segment with the spikes 
|as endpoints, and then removes the spikes from the ground. 
| Years later, the pirate returns to the island to get his treasure. Unfortunately, the gallows are gone, and 
| there is no sign of where it had been. Fortunately, the oak tree and pine tree are still there. How can the pirate 
| find his treasure? 


Solution for Problem 8.8: We let G, P, and K be the gallows, the pine, and the oak, Si 
respectively. We also let S; and S2 be the spikes, and T be the treasure, as shown in the 
diagram. 


Because GK = KS, and ZGKS, = 90°, the first spike is a 90° counterclockwise rotation 
of the gallows about the oak tree. (It’s a counterclockwise rotation because the pirate c 
turns right when he reaches the oak tree from the gallows.) We know how to handle 
rotations on the complex plane. Moreover, 90° rotations are particularly easy to handle P 
with complex numbers. So, we place the problem on the complex plane. We let the lowercase letters of each of 
the point labels be the corresponding complex numbers. 


Because S4 is a 90° counterclockwise rotation of G about K, we have 
sı =k+(g—ke™? =k+(g—-hi. 


Similarly, the second spike is a 90° clockwise rotation (because the pirate turns left at the pine) of the gallows about 
the pine, so 7 
=R 


s2 =p+(g—pe™” =p + (g - p)(-ì). 


To find the treasure, we have to figure out how to find the midpoint of the segment connecting two points on 
the complex plane. Fortunately, this is pretty easy. The midpoint of the segment connecting (x1, yi) and (x2, y2) in 
the Cartesian plane is ( ax, nt) . In the complex plane, this means that the midpoint of the segment connecting 


2 
zı =X1+ yi and z2 = X2 + yal is 

4 (= 2) 4 (45%); 

2 2 ý 
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which equals (zı + 22)/2. Intuitively, this makes sense—it tells us that the midpoint of the segment connecting two 
complex numbers in the complex plane is the average of the two numbers. 


Applying this to our problem to find t, the complex number corresponding to the treasure, we have 


mita k+(g-ki+p+(g-p(-) _k+gi-ki+p-gi+pi _k+p n pak | 
—— 2 p 2 “oD 2 

There’s no g in this expression—that means that the location of the treasure doesn’t depend on the gallows at all! 

It only depends on the pine and oak trees, which means that the pirate can still find the treasure. 


But how? How can he find the point that corresponds fe = = i? One quick way he can do that is to choose 
his origin wisely. 


‘Concept: When placing a geometric problem on the complex plane, we can choose where 


Ose toplacethe origin. Choosing a convenient point can often simplify the problem 
| a ; 


We recognize the first term in the expression = Sf. pe -1, the point corresponding to 


Ei is the midpoint g the segment connecting Ti a So, if we choose this Apo K 


< be the origin, then “ 2 = 0, and the treasure is simply at ee. -1. Moreover, since ae =0, 
- : p) 


we have k + p = 0, so i= = —p. Therefore, the treasure is at -1, which simplifies to 
pi. Aha! The point corresponding to pi is a 90° couat wise rotation of p about the 
origin. 


We now have a plan to find the treasure. We find the midpoint of the segment 
connecting the trees. We measure the distance from that point to the pine tree. We stand at the midpoint (the 
origin of our complex plane), face the pine tree, turn 90° counterclockwise, and then walk that measured distance 
in the direction we face. Then, we dig. 


And we find the treasure. 0 


On the way to finding the treasure, we also discovered an unsurprising geometric relationship on the complex 
plane. 


Important: « The midpoint of the segment connecting z and Z2 in the complex plane is ne q 


8.1.1 Suppose w is the image of z upon a 180° rotation about the origin. Express w in terms of z. 
8.1.2 


(a) What is the image of 3 + i upon a 30° counterclockwise rotation about the origin? 


(b) What is the image of 3 + i upon a 120° clockwise rotation about 1 + 2i? 


8.1.3 Note that argument of w + z is average of the arguments of w and z in Problem 8.2. Is this always true? If 
not, when is it true? 


8.1.4 Describe in words the transformation that maps z to z/ (re), where r is a positive real constant and 9 isa 
real constant. 
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8.1.5 For each of the following functions, describe the corresponding transformation that maps z to f(z). 
(a) f(z)=z-5+2i. 
b) f(z) =32-1. 
© fo = (4+ ¥i)z 
(d) f(z) =(-1 + i)z. 
(e) f(z) = -iz -1 + 3i. 


8.1.6 Suppose that U is on WZ in the complex plane such that UW/UZ = a/b. Show that if u, w, and z are complex 
numbers corresponding to U, W, and Z, then u = (az + bw)/(a + b). 


8.1.7x Let zı be a complex number. Let z2 be the complex number obtained when 2; is rotated counterclockwise 
about the origin by +, and let z3 be the complex number obtained when 22 is rotated counterclockwise about 4 + 3i 
by 5. Then z3 is the same as the point obtained when z; is rotated counterclockwise about w by @. Find w and 8. 


8.2 Parallel and Perpendicular Lines 


"Problems tet 


Problem 8.9: Let W and Z be points in the complex plane corresponding to nonzero complex numbers w and z, 
respectively, and let O be the origin. 


(a) Show that if Wz passes through O, then z/w is a real number. 
(b) Show that if z/w is a real number, then WZ must pass through O. 


Problem 8.10: Suppose S, T, and U are points in the complex plane corresponding to distinct complex numbers 
s, t, and u, respectively. Three points are said to be collinear if a line passes through all three points. In this 
problem, we show that these three points are collinear if and only if (s — #)/(u — t) is real. 

Let O be the origin. Consider the translation that maps T to the origin. Let S’ and U’ be the images of § 
and U, respectively, under this translation, and let the complex numbers s’ and u’ correspond to S’ and U’, 
respectively. 

(a) Explain why S, T, and U are collinear if and only if S’, O, and U’ are. What must be true of s’ and w’ if 
and only if S’, O, and U’ are collinear? 


(b) Show that S, T, and U are collinear if and only if (s — £)/(u — t) is real. 


Problem 8.11: Let AB and CD be segments in the complex plane, and let a, b, c, and d be complex numbers 
corresponding to the endpoints of these segments. Show that AB || CD if and only if (b — a)/(d — c) is real. 


Problem 8.12: Use complex numbers to show that a quadrilateral is a parallelogram if and only if its diagonals 
bisect each other. 


Problem 8.13: Let AB and CD be segments in the complex plane, and let a, b, c, and d be complex numbers 
corresponding to the endpoints of these segments. Find a condition similar to that in Problem 8.11 that is 


necessary and sufficient to show that AB 1. CD. 
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Problem 8.14: Let AB and CD be segments in the complex plane, and let a, b, c, and dbe complex numbers 
corresponding to the endpoints of these segments. 


- (a) Show that AB || CD if and only if (a - b)@ - d) - @- D)(c - d) = 
(b) Show that AB CD if and only if (a — b)(@ - d) + (@- DX - d) = 0. 


Problem 8.15: Suppose VWXYZ is a pentagon such that the altitudes from V, W, X, and Y to the respective 
opposite sides of the pentagon (V to XY, W to YZ, etc.) meet at a common point P. In this problem, we show 
that the altitude from Z to WX passes through P as well. 


(a) Write four equations corresponding to the four given altitudes. (Tip: Choose your origin wisely!) 
(b) Write an equation that must be true if the altitude from Z to WX passes through P. 
(c) Combine your equations in part (a) to produce the equation in part (b). _ 


Problem 8.9: Let W and Z be points in the complex plane corresponding to nonzero complex numbers w and 
z, respectively, and let O be the origin. Show that WZ passes through O if and only if z/w is a real number. 


Solution for Problem 8.9: We must show both of the following: 


(a) If WZ passes through O, then z/w is real. 
(b) If z/w is real, then WZ passes through O. 


Important: Suppose P and Q are mathematical statements. The statement “P if and only if 
Q” means both of the following: 
e If P is true, then Q is also true. | 
| 
e If Q is true, then P is also true. 


Both of these statements must be proved in order to show that “P if and only 


if Q” 


We'll prove parts (a) and (b) separately. We’ll start with part (a). Since W and Z are on the same line through 
the origin, we know that we can transform W to Z by performing a dilation about the origin (possibly with a 
negative scale factor). Therefore, there is some real constant c such that z = cw, from which we have 2 =c. 


Next we must take care of part (b). If = = c, where c is a real number, then we have z = cw. Since c is real, this 
means that Z is the image of W under some dilation about the origin, which means that Z is on OW, as desired. 


We also could have tackled part (b) by noting that if w = re’, then z = cre’®. If c > 0, then W and Z have the 
same argument. Ifc < 0, then their arguments differ by 7. In both cases, W and Z lie on the same line through the 
origin. O 


We say that three points are collinear if they lie on the same line. Now that we know how to tell if the line 
through two given points in the complex plane passes through the origin, let’s see if we can develop a similar test 
to determine if any given three points are collinear. 


| Problem 8.10: Suppose 5, T, and U are distinct points in the complex plane corresponding to complex numbers 
s, t, and u, respectively. Show that S, T, and U are collinear if and only if (s — t)/(u — t) is real. 
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f 


Solution for Problem 8.10: Just as we did with the general rotation problem in 
Problem 8.7, we tackle the general problem of collinearity by using translations 
to turn it into a problem we already know how to solve. We know how to tell if 
the line through two points passes through the origin, so we translate all three 
points such that the image of one of these points is the origin. The images of S, 
T, and U are collinear if and only if S, T, and U are. (We can see this by noting 
that applying a translation to ZSTU will not change the measure of the angle.) 


The expressions s — f and u — £ in (s — £)/(u — t) suggest translating T to the 
origin, since such a translation maps S and U to points S’ and U’ such that 
s’ =s—tandw’ = u-t. Letting O be the origin, we can now apply our result from Problem 8.9. There, we showed 


that FU passes through the origin if and only if s’/w’ is real. Substituting our expressions for s’ and u’, we see 
that FU passes through the origin if and only if (s — £)/(u — t) is a real number. Since S$’, O, and U’ are collinear if 
and only if (s — £)/(u — t) is real, and S, T, and U are collinear if and only if S’, O, and U’ are collinear, we conclude 
that S, T, and U are collinear if and only if (s — f)/(u — t) is real. O 


The diagram at right above is not the only possible configuration; it is possible 
that T is on SU, but not between S and U. However, the fact that Sip passes 
through the origin if and only if (s — f)/(u — t) is real does not depend upon the 
possible orderings of S’, U’, and O. So, we don’t have to worry about different 
cases. 


Suppose S,T, and U are distinct points in the complex plane corresponding to 
complex numbers s, t, and u, respectively. S, T, and U are collinear if and only 
if (s - t)/(u - t) is real. = 


We'll now turn to parallel lines. In addition to the usual definition that two lines are parallel if they do not 
intersect, we will also say that a line is parallel to itself. We can think of this as “all lines with the same direction 
are parallel.” We will make the concept of “direction” of a line more rigorous in Section 9.3. 


Problem 8.11: Let AB and CD be segments in the complex plane, and let a, b, c, and d be complex numbers 
corresponding to the endpoints of these segments. Show that AB I CD if and only if (b — a)/(d —c) is a real 
number. 


Solution for Problem 8.11: There’s no need to reinvent the wheel here; we have 
a pretty clear guide how to tackle this problem in our solution to Problem 8.10. 
We start by translating AB such that the image of A is the origin, O. Let the 
image of B under this translation be B’, so that b’ = b—a. Similarly, we translate 
CD such that the image of C is the origin and the image of D is D’, which gives 
us d’ = d—c. Since OB’ || AB and OD’ Il To we have AB Il ED ifand only if OB’ 
and OD’ are the same line, because there is only one line through O parallel to 
4B. In Problem 8.9, we saw that OB’ and OD’ are the same line if and only if 
b’ /d’ is real. Substituting our expressions above for b’ and @’, this tells us that 
OB’ and OD’ are the same line if and only if (b — a)/(d - c) is real. Every step 
in our reasoning is an “if and only if” step, so we have proved that AB | ED if 
and only if (b — a)/(d — c) is a real number. O 


~ Let AB and CD be segments in the complex plane, and let a, b, c, and d be 
complex numbers corresponding to the endpoints of these segments. Then 
lines AB and CD are parallel if and only if (b — a)/(d-— c) is a real number. | 


i Important: 


Ky, 
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‘Problem 8.12: Use complex numbers to show that a quadrilateral is a parallelogram if and only if its diagonals § 
bisect each other. 


Solution for Problem 8.12: Let complex numbers a, b, c, and d correspond to the vertices of quadrilateral ABCD in 


the complex plane. Then, the midpoint of diagonal AC is “° and the midpoint of diagonal BD is +44. 


First, we show that if the diagonals of ABCD bisect each other, then ABCD is a parallelogram. If the diagonals 
of ABCD bisect each other, then the midpoints of AC and BD are the same. That is, we have 


a+c _ b+d 
e ha 


soa +c = b+ d. We'd like to show that AB || DC, which we can prove by showing that (b — a)/(c — d) is real. So, 
we rearrange a +c = b +d asc -d = b-a. Dividing both sides by c — d gives ta = 1. Since this is real, we have 
AB || DC. We could go through essentially the same steps to show that AD || BC, or we could note that since 
c-d = b-a, we have |c — d| = |b — al, which means that DC = AB. Combining AB = DC with AB || DC tells us that 
ABCD is a parallelogram. 


- Let AB and CD be segments in the complex plane, and let a, b, c, and d be 
complex numbers corresponding to the endpoints of these segments. We have 
AB = CD if and only if |b — a| = |d - cl. 


Next, we show that if ABCD is a parallelogram, then the diagonals bisect each 
other. We can take a bit of a shortcut here by letting A be the origin, which means 
the midpoint of AC is simply §. Then, because AB || DC and AB = DC, the 
translation that maps B to A also maps C to D. This translation maps B to the 
origin, so it maps any complex number z to z — b. Because this translation also 
maps C to D, we have d = c — b. Therefore, we have d + b = c, so the midpoint of 
BD is 2d = §. This tells us that diagonals AC and BD have the same midpoint, 
which means they bisect each other. 0 


Having tackled parallel lines, let’s turn to perpendicular lines. 


‘Problem 8.13: Let AB and CD be segments in the complex plane, and let a, b, c, and d be complex numbers § 
corresponding to the endpoints of these segments. Find a condition similar to that in Problem 8.11 that is = 


necessary and sufficient to show that AB 1 CD. 


Solution for Problem 8.13: Solution 1: Use the method of previous problems. As before, we 
start with translations to the origin. We translate AB such that the image of A is the 
origin. This translation maps B to a point P’ that corresponds to b’ = b — a. Similarly, 
we translate CD such that the image of C is the origin, which means the image of D is 
the point D’ that corresponds to d’ = d — c. 


Translating a segment does not change its orientation, so AB Il OB’ . Therefore, the 
angle ÖD’ makes with OB’ is the same as the angle OD! makes with AB. Similarly, OB’ 
makes the same angle with ED as it makes with OD’. Combining these observations, 
we see that the angle between 4B and ED is the same as that between OB’ and OD’. 
This means that AB 1 ED if and only if OB’ 1 OD’. Therefore, our problem now is 
to prove that OB’ L OD’ if and only if = is imaginary. Since b’ = b- a and d' =d-c, 
we must show that OB’ 1 OD’ if and only if b’/d’ is imaginary. 
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. lf OB L OD’, then the arguments of b’ and d’ differ by 5. Therefore, if d’ in exponential form is re? then b’ is 
seOH(/2)) or se(P(7/2)) where r and s are real numbers. If b’ = se“9+("/2)), then we have 


SOON cee 
d' reið r po 


Similarly, if b’ = se'®-07/2), then we find Ẹ = —ŝi. In either case, $ is real, so Ẹ is an imaginary number. 


We must also show that if 5 = ki for some real constant k, then OB’ L OD’. Multiplying both sides by d’ gives 
b’ = (kd’)i. Therefore, B’ is a 90° counterclockwise rotation about the origin of the point corresponding to kd’. The 


point corresponding to kd’ is a dilation of D’ about the origin, so it is on OD’. Since B’ is a 90° rotation about the 
origin of a point on OD’, we have OB’ 1 OD’, as desired. 


Solution 2: Use the result of Problem 8.11. We turn our problem about perpendicular lines into one about 
parallel lines by rotating one of the lines 90° about the origin. Let ED be the image of CD upon a 90° rotation 
counterclockwise about the origin. Therefore, we have c’ = ci and d’ = di. We have AB 1 CD if and only if 
AB || ED . From Problem 8.11, we have AB Il CD’ if and only if b-a is a real number. That is, AB || ED if and 


d’—c’ 
only if + = t for some real constant t. Substituting our expressions for c’ and d’, this equation becomes 
b-a 
di—ci 


Multiplying both sides by i gives 4 = ti, which is imaginary. Therefore, we have AB 1 CD if and only if i= is an 
imaginary number, as before. O 


Important: | Let AB and CD be segments in the complex plane, and let a, b, c, and d be 
complex numbers corresponding to the endpoints of these segments. Then 
AB 1 CDif and only if (b — a)/(d — c) is an imaginary number. 


| Problem 8.14: Let A, B, C, and Dbe four different points in the complex plane, and let a, b, c, and d be complex 
numbers corresponding to these points. 


| (a) Show that AB || GD if and only if (b - a)\(d-@ - b - a)(d - c) = 0. 
| (b) Show that AB 1 CD if and only if (b — a)(d - 0) + (6-a)(d-c) = 0. 


Solution for Problem 8.14: 


(a) In Problem 8.11, we showed that AB || CD if and only if (b — a)/(d —c) is real. Back on page 204, we showed 


that a complex number z is real if and only if Z = z, which we can write as z — Z = 0. So, because AB I CD if 
and only if (b — a)/(d — c) is real, we must have z — Z = 0 for z = (b — a)/(d — c): 


b-a b-a 
ao (8.1) 
Since we have M 
(=o b-a _ b-a 
doc d-c d-t 
we can write Equation (8.1) as $ 
b-a b-a LG 
d-c d-t ` 


Multiplying both sides by d — c and by d — 7, we have the desired 
(b —a)(d—7) —-(b-a)(d-c) =0. 
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(b) We can follow essentially the same steps as in the previous part. In Problem 8.13 we showed that 4B LED 
if and only if (b—a)/(d—c) is imaginary. Back on page 204, we learned that a complex number z is imaginary 
if and only if Z = —z, which we can rearrange as z + Z = 0. So, letting z = (b — a)/(d — c), we have 


which we can write as 


Multiplying both sides by (d — c)(d — ©) gives 
(b—a)(d —2) + (b-—ay(d—c) =0. 


O 


Our work in Problem 8.14 allows us to write the conditions “(b—a)/(d—c) isa realnumber” and “(b—a)/(d—c) is 
an imaginary number” as equations involving only a, b, c, and d. These equations are often much more convenient 
to work with than the qualitative description of (b — a)/(d — c) being real or imaginary. 


Important: — 


Y 


Let A, B, C, and D be four different points in the complex plane, and let a, b, c, 
and d be complex numbers corresponding to these points. 


e We have AB || ED if and only if (b — a)(d - ©) — E —a)(d - c) = 0. 


e We have AB 1 CD if and only if œ- a)(d -9 + -AA -9=0. 


These aren’t really new observations; they are equations that follow from the conditions we found earlier 
under which AB and CD are either parallel or perpendicular. 


Looking back at our solution to Problem 8.14, we actually proved something a little bit stronger than we were 
required to. We needed A and B to be different to define line AB, and needed C and D be different to define EB. 
But nowhere in our proof do we use the fact that A and B differ from C and D! We could have had B and D be 
the same point, and every step in the proof holds. Therefore, by letting B and D be the same point, we have a 
condition for A, B, and C to be collinear: 


Let A, B, and C be three different points in the complex plane, and let a, b, and 
c be complex numbers corresponding to these points. A, B, and C are collinear 
if and only if 


FRIESE 


-G-9 - G-Hb- 0) =0._ 


The condition for perpendicularity given above also holds if we let B and D be the same point. 


Concept: ~ Sometimes we can prove a statement by proving a more general statement. 


Problem 8.15: Suppose VWXYZ is a pentagon such that the altitudes from V, W, X, and Y to the respective 
opposite sides of the pentagon (V to XY, W to YZ, etc.) meet ata common point P. Show that the altitude from 
Z to WX passes through P as well. 
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Solution for Problem 8.15: We are given four pairs of perpendicular lines and must use them to prove a fifth pair 
of lines are perpendicular. One point is common to all the pairs of lines: P. So, we'll let P be the origin, expecting 
that to make our algebra simpler. We let v, w, x, y, z be the complex numbers corresponding to the respective 
vertices of the pentagon. Since VP iL XY , we have 
(v — O)(X — y) + ©- O)(x— y) = 0. 
We simplify v — 0 and 7 — 0, and then write similar equations for each of the other three altitudes, and we have 
U(X — y) + W(x — y) =0, 
wl -2) + Wy —z) =0, 
x(Z — 0) + X(z— v) =0, 
y0 - wW) + yv - w) = 0. 
We’d like to prove that ZÈ L WX, which is true if and only if 
z(w — X) + Z(w — x) = 0. (8.2) 
Looking at our initial four equations above, we see that all the terms in the expansion of Equation (8.2) appear in 
the expansions of the first four equations. Unfortunately, there are a bunch more terms in those four equations. 
Let’s go ahead and expand them and get a closer look at those extra terms: 
vX — vy + Ux —vy = 0, 
Wy — WZ + Wy — wz = 0, 
xZ — XU + Xz —- Xv = 0, 
yo — yw + yv — yw = 0. 
If we add all of these equations, the extra terms will all cancel! For example, the vx in the first equation cancels 
with the —xv in the third equation, and the —vy in the first equation cancels with the yv in the fourth equation. 
The terms that don’t cancel when we add all four equations leave us with 
= U2 ee — We = A 
Factoring, multiplying by —1, and doing a little rearranging, gives us 
z(W —X)+2(w - x) =0, 
so ZÈ il WX, as desired. O 
We glossed over an important strategy in setting up our solution to Problem 8.15. We wrote the four equations 
corresponding to the four given altitudes in a way that made it clear that we would have a lot of cancellation 
when we added the four equations. For example, if instead of writing the four equations 
oF 7) + B(x — y) =0, 
w(y — Z) + wy — z) = 0, 
x(Z — 3B) + xX(z — v) = 0, 
y0 - wW) + yv — w) = 0, 
we had written the four equations 
u(x — y) + U(x — y) =0, 
wy — Z) + Wy — z) = 0, 
x(Z — v) + x(z - v) =0, 
yD — 0) + ¥w-v) =0, 
then adding the four equations won’t cancel the v} and yọ terms. We'd have to subtract the last equation instead, 


and our key step would be “add the first three equations and subtract the last,” which is a much harder key step 
to see. (Look at the last equation in each set to see the difference between the two systems.) 
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We make sure our equations are set up in a convenient way by building them ina consistent x Ww 
manner. Here, we can set our equations up with a nice symmetry by visualizing the pentagon, 
and making sure the three points in each equation appear in the same order that they appear v 
when we travel counterclockwise around the pentagon. For example, we go counterclockwise 
when we go from V to X to Y around the pentagon, and v, x, and y appear in that order in the 
first equation. Similarly, we go counterclockwise to go from W to Y to Z, so w, y, z appearin Y 
that order in the second equation, and so on. 


Maintaining symmetry in systems of equations helps us find effective ways to 
work with the equations. 


| Concept: 


Exercises a 


8.2.1 Which of the following complex numbers lies on the line through 1 + 3i and 5 — 27 in the complex plane: 
go 7 2 5i. 


8.2.2 Show that the equation whose graph consists of all complex numbers z on the line through a and b in the 
complex plane is 


8.2.3 Prove that the quadrilateral formed by connecting the midpoints of any quadrilateral is a parallelogram. 


8.2.4 In Problem 8.13, we used an algebraic approach to show that if O is the origin and points B’ and D’ in 
the complex plane satisfy OB’ L OD’, then z is an imaginary number, where b’ and d’ are the complex numbers 
corresponding to B’ and D’, respectively. Prove this fact using geometric transformations. That is, find a sequence 
of transformations that maps B’ to D’, and use this sequence to explain why 5 must be imaginary. 


8.2.5 A median of a triangle is the segment from a vertex of the triangle to the midpoint of the opposite side. In 
this problem, we show that the medians of a triangle are concurrent at a point called the centroid, and show that 
the distance from a vertex of the triangle to the triangle’s centroid is 2/3 the length of the median from that vertex. 


(a) Let a, b, and c be complex numbers corresponding to the respective vertices of AABC. Show that if 
g = (a+b + c)/3 corresponds to point G, then point G is on all three medians. 


(b) Let M be the midpoint of BC. Show that AG/GM = 2. 
8.2.6 Show that the complex numbers a, b, and c are collinear if and only if 
ab + bē + ca = āb + be + ĉa. 
8.2.7 In convex quadrilateral WXYZ, the midpoints of WX, WY, and XZ are collinear. Show that the line through 
these points also passes through the midpoint of YZ. 


8.2.8x Let A correspond to 1 — 5i, B correspond to 3 + 2i, C correspond to SN Orr 7i, and D correspond to 
—2 — 2i V3. Find the acute angle between 4B and CD. 


8.3 Distance 


We introduced the magnitude of a complex number as the distance between that complex number and the origin 
in the complex plane. From this definition, we also saw that if w and z are complex numbers, then |w — z| is the 
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distance between w and z in the complex plane. In this section, we use the magnitude of complex numbers to 
solve problems involving lengths of segments. 


a > 


Problem 8.16: Let complex numters w and z correspond to points W and Z in the complex plane. Show that 


WZ? = (w — z)(W - 2). 


Problem 8.17: Prove that AB 1 ED if and only if AC? + BD? = AD? + BC?. 


Problem 8.18: Let c be a complex number and t be a real number such that t < cé. In this problem, we show that 
the graph of an equation of the form 
2E = EGE | = 
is a circle. 
(a) Find an equation whose graph is a circle with center c and radius r. 


(b) Show that the equation you found in part (a) is equivalent to zZ — cz — ¢z + t = 0 for some real number t. 
Why must we have the restriction t < ct? 


Problem 8.19: In this problem, we find the figure that consists of all points Z such that AZ = k - BZ, where A 
and B are points and k is a positive real number with k + 1. 


(a) Let A be the origin and B be 1 in the complex plane. Find an equation whose graph consists of the points 
Z such that AZ = k- BZ. 


(b) Show that the desired figure is a circle. 


Problem 8.20: The Triangle Inequality is a fancy way to say that the shortest distance between two points is a 
straight line. Specifically, the Triangle Inequality tells us that for any three points A, B, and C, we have 


AB + BC > AC, 


where equality holds if and only if B is on AC. 
(a) Let wand z be complex numbers. Use the Triangle Inequality to prove that j| + |z| 2 {w + z|. Under what 
conditions do we have |w| + |z| = |w + z|? 
(b) Let z1,22,...,2, be complex numbers. Prove that |z1| + |z2| + +++ + [2a] 2 |z1 + Z2 +--+ + Zyl. Under what 
conditions do we have |z;| + |Z2] + -< + |Z,| = Za + Z2 +--+ + Zahi? 


(c) Show that for any real numbers a, b, c, and d, we have Va? +b2+ Ve +d? > q/(a +c) + (b+). 


Problem 8.21: Find the smallest possible value of |z — 5| + |z + 4 — 3il. 


WZ? = (w — z)(@ —2). 


Solution for Problem 8.16: We have WZ = |w — z|, SO 
WZ? = |w —z)? = (w-2)(w—2z) = (w-z)\@-2), 


as desired. O 
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Important: If complex numbers w and z correspond to points W and Z in the complex. 


Y plane, then WZ? = (w — z)(W — 3). 


Problem 8.17: Prove that AB 1 CD if and only if AC? + BD? = AD? + BC?. 


Solution for Problem 8.17: We think to use complex numbers in this problem because we have a straightforward 
expression for the square of the distance between two points in the complex plane, and we have a somewhat 
similar simple condition for perpendicularity. We let a, b, c, and d be complex numbers corresponding to points 
A, B, C, and D in the complex plane. We then have 

AC? + BD? = (a -c)(@-7) + (b — d)(b — d) = a — ac — cā + cE + bb — bd — db + dd, 

AD? + BC? = (a -d)(@- d) + (b — c)(b — ©) = aa — ad — da + dd + bb — be - ch + c. 


We see a lot of common terms in these, so subtracting will give us some cancellation: 


(AC? + BD?) — (AD? + BC?) = (-at — ca — bd — db) + (ad + da + be + cb) 
= a(d — ©) + b(¢ — d) + c(b - a) + d(ā — b) 
= a(d - ©) — b(d - ©) - c(a — b) + d(a — b) 
= (a — bì\(d - ©) + (d — c)(@ — b). 


So, we now have AC? + BD? = AD? + BC? if and only if (a — b)(d — ©) + (d — c)(ā — b) = 0. But this is exactly 
the condition for perpendicularity that we proved on page 270. Since AC? + BD? = AD? + BC? if and only 
if (a — b)\(d — 0) + (d —c)(@— b) = 0, and (a — b)(d - 7) + (d-c)(@— B) = 0 if and only if AB 1 CB, we have 
AC? + BD? = AD? + BC? if and only if AC? + BD? = AD? + BC. 0 


So far, we've discussed points, lines, and distance. Since a circle is defined in terms of a point and a distance, 
we can describe a circle in the complex plane as the graph of an equation. 


Problem 8.18: Let c be a complex number and t be a real number such that t < cc. Show that the graph of an 
equation of the form 


zz —cz—cz+t=0 


is a circle. 


Solution for Problem 8.18: If point z is on the circle with center c and radius r, then |z — c| = r. Since both sides of 
|z - c| = rare positive, the graph of |z — c| = r is the same as the graph of |z — c|? = r°. We prefer this latter form 
because it allows us to get rid of the magnitude symbol, since |z — c|? = (z—c)(z— c) = (z - c)(Z- 72). So, the equation 
|z — cl? = 7? becomes 


(z-c)(z-=Pr. 
Expanding the left side and subtracting r* from both sides gives 
zz —tz—- z+ - 7 =0. 
Since c¢ must be real and 7° is real, letting t = cc — r? gives us 
zz —Cz—-cz+t=0, 


where t is the real number cc — r°. 
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Tn ue te 


To see that the graph of any such equation is a circle if £ < c¢, we need only walk backwards through our steps 
above. We write t as cé + t — cc, so we have 


ZZ —Cz—czZz+cc+t—-—cc=0. 


Factoring the first four terms and moving the other terms to the other side of the equation gives (z—c)(Z—¢) = c —t, 
so (z —c)(z — c) = ce — t. Therefore, we have 


lz -c = c-t. 


The left side must be nonnegative. Now we see why we must have the condition t < cc. We can write c¢ — t = 7° 
for some positive r, and we have |z — c}? = r*. Taking the square root gives |z — c| = r, and we know the graph of 
this equation is a circle with center c and radius r. O 


The equation we just found for a circle in the complex plane is rather unwieldy, so we usually stick to the tools 
of Euclidean geometry in problems involving circles. However, applications of complex numbers to circles can be 
very effective, as we'll see in the next problem. 


Problem 8.19: What figure consists of all points Z such that AZ = k- BZ, where A and B are points and kis a 


positive real number with k # 1? 


Solution for Problem 8.19: We let a, b, and z correspond to A, B, and Z, respectively, so that AZ = k - BZ becomes 
lz — a| = klz — b|. Before we square this equation to get rid of the magnitudes (which we can safely do because 
both sides are positive), we make life a bit easier on ourselves by letting a = 0 and b = 1, because this makes our 
equation |z| = k|z— 1|. We can make this simplification because no matter where A and B are, we can draw the axes 
such that A is the origin and B corresponds to 1 on the real axis. 


Squaring both sides of |z| = k\z — 1| gives |z? = klz — 1, from which we have 
zz =e(z-1)2-1 =R(z- DZ- 1) = Rh @z-z-Zz+1). 


Rearranging gives 


(2 = 1)27 — Iz —7 +i =0. 
Dividing by k? — 1 gives 
aoe ei 
If we letc = t= =, then we can write this equation in the form that we found in Problem 8.18: 
za =z = l = A, 


(Note that c = ¢ because c is real.) We must only confirm that cc — t > 0 to deduce that the graph is a circle. Since 
t = c =T, we have 


2 k? 2 k 1 k? 
c-t=c?-c=c(c-1)= (aS) (=a!) 2 (a) gale -1 


which is clearly positive, so the graph is a circle. Since c = Aa) the center of this circle is the point corresponding 


to AS on the real axis. And since cc — t = ane the radius of the circle is a: o 


The circle we discovered in Problem 8.19 is called a circle of Apollonius, which is a circle that is defined as the 
set of points Z such that AZ = k - BZ for some given points A and B and some positive constant k # 1. 


We'll finish our discussion of distance in the complex plane with the Triangle Inequality, which is a fancy way 
to say that the shortest distance between two points is a straight line. 
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Important: The Triangle Inequality states that for any three points A, B, and C, we have 
y AB + BC 2 AC, 


where equality holds if and only if B is on AC. 


| Problem 8.20: 
(a) Let w and z be complex numbers. Show that |w| + |z| > |w + z|. Under what conditions do we have 
|w| + Iz] = |w + z|? 


b) Let 21,22,...,Z, be complex numbers. Prove that |z,| + [zo] + ++ + Izal = |z1 + 22 +++- +2Zn|. Under what 
P 
conditions do we have |z| + |Z2| +--+ + |Z,| = [z1 + Z2 +--+ Znal? 


(c) Show that for any real numbers a, b, c, and d, we have Va? +b? + Vc? + d* > 


(a +c)? + (b +d}. 


Solution for Problem 8.20: 


(a) First, we note that the inequality obviously holds (and equality holds) if w or 
zis 0. To address the case in which both are nonzero, we turn to the complex 
plane. Let points W, Z, and S correspond to complex numbers w, z, and w Lz, 
respectively, and let O be the origin. As we have seen several times before, OWSZ 
is a parallelogram, so WS = OZ = |z|. Applying the Triangle Inequality to AOWS 
gives us OW + WS = OS. We have OW = |w|, WS = |z| and OS = |w + z|, so we 
have |w| + |z| > |w + z|, as desired. 


We have equality in the Triangle Inequality OW + WS = OS if and only if W is 
on OS, so we have equality if and only if w is on the segment connecting the origin to w + z in the complex 
plane. Since w is nonzero, this means w + z must be nonzero and that w = c(w + z) for some real constant 


c with 0 < c < 1. Solving for z, we have z = 2 - w = (1-1) w. Since 0 < c < 1, we have 1-1 > 0, so 


z= € = 1) w tells us that z is a dilation of w about the origin with nonnegative scale factor. This means 
that w and c are on the same ray from the origin. 


(b) We can extend the first part to the sum of the magnitudes of any number of complex numbers. Repeatedly 
applying part (a), we have 

zal + al + zal + ++- + lanl 2 la + z2l + eal + +-+ + Len (8.3) 

2 |Z + 22 + 23| + [zal +--+ + [Zal (8.4) 


> : 
> |Z. + 22 $23 +- + Znal + [Zn 
> |En 1 Za + ee Zyl 


To see when equality holds, we note that we must have equality in each step above in order to have 
Izal + lal + l2a] + ++: + [Zn] = [2a + z2 +--+ + zyl. 


In our first step on line (8.3), we use the result of part (a) to get |z1| + |z2| > |z1 + z2|, where equality holds if 
and only if zı and z2 are on the same ray through the origin. lf zı and z2 are on the same ray through the 
origin, then z; + z2 is on this ray as well. In our next step on line (8.4), we again apply the result of part 
(a) to get [zı + Z2| + |z3| = |z1 + Z2 + Z3|, where equality holds if and only if z; + z2 and z3 are on the same 
ray through the origin. The ray from the origin through z1 + z2 is also the ray through z; and z3 (if equality 
holds in our first application of the Triangle Inequality), so we have |z;| + |Z2| + |z3| > |z1 + z2 + z| if and only 
if Z1, Z2, and z3 are on the same ray from the origin. Continuing in this way, we see that we have 


al lz2l + 2g Zyl = 2 ee | 
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if and only if z1, Z2, Z3, . . . , Zn are on the same ray from the origin. 


(c) We can use magnitude to relate the square root of a sum of squares to a complex number: |a + bil = Va? + b2. 
Similarly, we have |c + di| = Vc? + d?, so from the Triangle Inequality expression 


la + bil + |c + dil > |(a + bi) + (c + dì], 


we have 
Va? +b? + Vc? +d > \(a+c) + (b+ dil = /(a +c) + (b+ d}. 
E 
Important: For any complex numbers te Z3, Ree we have oa fi 
y (zal + lza] + ++ + |Z] 2 |Z1 + Spt + Zal, 
where equality holds if and only if z1, 22, Z3, . - . , Zn are on the same ray from the 
| origin. 


The version with two terms, |z;|+ |Z2| = |z1 + Z|, is often referred to as the Triangle Inequality. In Problem 8.20, 
we proved this algebraic expression of the Triangle Inequality by using the geometric expression of it. You 
might wonder if it’s possible to go the other way—can we use the algebra of complex numbers to deduce that 
IZ1|+ |z2| = |z1 + Z2|, and then use this to produce the geometric interpretation of the Triangle Inequality? Reversing 
our solution to part (a) above takes us from |zj| + |z2| = |z1 + Z2| to the geometric interpretation of the Triangle 
Inequality, but proving that |z,| + |z2| 2 |z1 + z2| with only algebraic tools is a bit trickier! You'll have a chance to 
show how to do it as an Exercise. 


| Problem 8.21: Find the smallest possible value of |z — 5| + |z +4 — 3 i]. 


Solution for Problem 8.21: Solution 1: Algebra. Applying the Triangle Inequality gives 
lz —5| + |z + 4 - 3il| > |(z — 5) + (z +4 - 3i)| = [22-1 - 3i]. 
This isn’t much help, since we can’t do much with |2z — 1 — 3i]. 
We'd like to use the Triangle Inequality in a way that eliminates z. We could do so if the z term were negative 
in one of terms. We can make that happen! Since |w] = |-w| for any w, we have 


Iz — 5| + |z +4- 3i] = |z - 5| + |-(z + 4 — 3) = Iz - 5| + |-z —4 + 3] > |(z - 5) + (-z -4 +30) = |-9 + 3il = 3 VIO. 


We still have to confirm that this value is possible. We do so by noting that when z = 5, we have |z- 5|+ |z +4- 3i| = 
0 + |9 — 3i] = 3 V10. 


Finding the minimum value of an expression is a two-step process. We must 
show both that the minimum value is achievable, and that no smaller value 
is possible. (Similarly, finding the maximum value of an expression is also a 


two-step process.) 


The key step here was thinking about what we’d like to have happen: 


_ A little wishful thinking can go a long way. p ia e 4 


Concept: 


277 


CHAPTER 8. GEOMETRY OF COMPLEX NUMBERS 


Solution 2: Geometry. The expression |z — 5] is the distance from z to 5 in the complex plane. The expression 
lz + 4 — 3i] equals z — (—4 + 3i), which is the distance from z to —4 + 3i in the complex plane. By the Triangle 
Inequality, the sum of the distances from z to 5 and to —4 + 3i in the complex plane is at least the distance from 5 
to —4 + 3i. The distance between these two points is |5 — (—4 + 32)| = |9 — 37] = 3 V10. As before, we must show that 
this value is achievable. From the Triangle Inequality, we know that we achieve this value if z is any point on the 
segment from 5 to —4 + 3i. We can obviously take z to be either endpoint, and we'll get 3 V10 as the value of the 
given expression. O 


Our second solution reinforces an important strategy when working with complex numbers: 


~ Some complex number problems that don’t appear to be about geometry can be 
solved quickly by thinking about the problem geometrically. 


Concept: 


) Exercises > 
8.3.1 What is the radius of the circle that is the graph of the equation zz + (—2 + i)z + (-2-1)z = 11? 
8.3.2 Show that if P is a point on altitude AX of AABC, then AC? — CP? = AB? — BP”. 


(a) Prove that |x + y -z| + |x — y + z| > 2ļxl. 
(b) Prove that |x| + |y| + |z| < |x + y = z| + lx- y + z| +|-x + y + zl. 


8.3.4 Show that if w is a complex number and a and b are nonzero real numbers such that |w| = |w +a] = jw+bil = 1, 
then g? + b? = 4. 


8.3.5 Show that the sum of the lengths of the diagonals of a quadrilateral is less than the perimeter of the 
quadrilateral. 


8.3.6x In the text, we used the geometric version of the Triangle Inequality to prove a version of the Triangle 
Inequality for complex numbers, |w] + |z| = |w + z|. Prove this inequality using only the algebraic properties of 
complex numbers, without using geometry. Hints: 93, 288, 87 


8.4 Regular Polygons 


We've already seen that complex numbers can be related to regular polygons when we discovered that plotting 
the n™ roots of unity on the complex plane gives us the vertices of a regular n-gon. In this section, we explore 
geometry problems involving regular polygons that can be tackled with complex numbers. 


Problems > 


Problem 8.22: Suppose w and z are the endpoints of a diagonal of a square in the complex plane. Find the other 
two vertices of the square in terms of w and z. 


Problem 8.23: The points (0,0), (a, 11), and (b,37) are the vertices of an equilateral triangle. Find ab. (Source: 
AIME) Hints: 80 


Problem 8.24: Let ABC be a triangle in the complex plane whose vertices correspond to the complex numbers 
a, b, and c. Prove that AABC is equilateral if and only if a — b = C(c —b), where Ç is a primitive sixth root of unity. 
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Problem 8.25: Suppose AP; P,P3 and AQ1Q2Q; are equilateral triangles with the same orientation. (By same 
orientation, we mean that the vertices are labeled in the same order, either clockwise or counterclockwise, in 
each triangle.) Let Mı, Mz, and M3 be the midpoints of P; Qi, P2Q2, and P3Q3, respectively. Show that AM,;M2M3 
is an equilateral triangle. 


Problem 8.26: Points X and Y are outside AABC such that AXAB and AYBC are equilateral and do not overlap 


AABC. Let P be the midpoint of XB, Q be the midpoint of YB, and R be the midpoint of AC. Prove that APQR 
is equilateral. 


Problem 8.27: Let n > 3 be a positive integer, and let sella pez be a regular n-gon inscribed in a circle with 
radius 1. Compute PoP; - PoP2--- PoPn-1. 


We'll start with a warm-up by considering a regular polygon that’s particularly easy to work with: a square. 


| Problem 8.22: Suppose w and z are the endpoints of a diagonal of a square in the complex plane. Find the 
| other two vertices of the square in terms of w and z, 


Solution for Problem 8.22: Let complex numbers w and z correspond to points W and Z in the Ww 


complex plane, and let A and B be the other vertices of the square, as shown at right. While it’s A 
not immediately obvious how to find A and B from W and Z, if we could locate the center of 


the square, we could quickly find A and B by rotating W and Z by 90° about this center. But we 
know how to find the center—it’s the midpoint of diagonal WZ. B 


Letting this midpoint be P, with corresponding complex number p, we have p = 7#. Since 
AP = WP = BP = ZP and AB 1 WZ, points A and B are 90° rotations of W and Z about P. The complex number 
corresponding to a 90° counterclockwise rotation of Z about P is 


_ ypni/2 . WtZ ee ee (5- =) (5+ =) 
pepe aoe + (z SA es 775 wt 2'3 Z; 


and the complex number corresponding to a 90° counterclockwise rotation of W about P is 


D ee ey = (5+ =) (5-5) 
p+ (w— pe? = P22 + (w- i= (54+5)w+(5-5)z 


We can let either of these correspond to A, and the other corresponds to B. O 


| Problem 8.23: The points (0,0), 0), (a 11), 
| AIME) 


and (b, 37) are the vertices of an equilateral triangle. Find ab. (Source: 


Solution for Problem 8.23: Let O be the origin, A be (a,11), and B be (b,37). We could start out with the distance 
formula. Since OA = AB = BO, we have 


Va2 +112 = \/(a—b)? + (11 - 37)? = Vb? + 372. 


Yuck. We can relate a2 and b? with Va2 + 112 = Vb? + 372, but the expression „(a — b)? + (11 — 37)? stops us cold. 
Substituting for a or b looks pretty ugly. 


Concept: | When your initial approach to a problem hits a seemingly unsurpassable obstacle, 
O= take a step back and look for another way to view the problem that allows you to 
avoid the obstacle. _ | 
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Using the lengths of the sides of the triangle didn’t help, so let’s consider the angles of the triangle. All three 
angles are 60 degrees, which might suggest the Law of Cosines. Unfortunately, that brings us back to the lengths 
of the sides, which gets us right back to the nasty expressions above. 


Since the lengths of the sides are difficult to deal with in this problem, we’d like to find a useful fact about 
equilateral triangles that doesn’t require that we use the actual lengths of the sides. Because AOAB is equilateral, 
point B is the image of rotating A by Ẹ about the origin. Notice that we are not using the actual lengths of the sides 
here—we are only using the fact that the side lengths are the same. This is promising, so we investigate further. 
We have just learned a tool for investigating rotations about the origin—complex numbers. 


To use complex numbers, we view the problem in the complex plane, so A is now the location of a+11zin the 
complex plane, and B corresponds to b + 37i. Because rotating A by 4 degrees about the origin gives us point B, 
we must have 

(a+1lie™ =b+37i or (ate =b + 37i. 


We'll work through the details of the first case, (a + 11i)e™/? = b + 37i. Expanding the left side gives 


i 11V3 1, 3p 
(a + ije™? = (a + 11i) G t X) “of = Ta 


2 ae 


so we have 


a_liv3, (11, av3 
Be 2 Dy” 2 


) i=vs ori 


Equating the real parts and equating the imaginary parts of both sides gives 


TR 
2 2 
11 av3 
Sages OM 57, 
2* 2 


The second equation gives us a = 21 V3, and substituting this into the first equation gives b = 5 V3, so ab = 315. 


If we work through the details of the case (a + 1lije“™/3 = b + 37i in the same manner as above, we'll find 


a = —21 V3 and b = —5 V3, so ab = 315 in this case, as well. This isn’t a surprise: the equilateral triangle we 
produce in this case is the reflection over the imaginary axis of the one we found in the first case. O 


In this problem, we combined our understanding of exponential and rectangular forms of complex numbers 
to tackle an analytic geometry problem that didn’t appear to have anything to do with complex numbers. One of 
the clues that complex numbers might be helpful in this problem is: 


‘Concept: Complex numbers can often be helpful in problems involving rotations, since 
rotations are easy to express with complex numbers. 


Problem 8.24: Let ABC bea triangle in the complex plane whose vertices correspond to the complex numbers 
a, b, and c. Prove that AABC is equilateral if and only if a — b = C(c — b), where C is a primitive sixth root of 


unity. 


Solution for Problem 8.24: The two primitive sixth roots of unity are e™/> and e°"/3, so we must show that AABC is 
equilateral if and only if a — b = e™/3(c — b) ora — b = &™/9(c — b). These equations look exactly like equations we 
encountered when discussing rotations. The equation a — b = e™/3(c — b) tells us that A is the image of C upon a 
3 radian counterclockwise rotation about B. That is, we have CB = BA and ZABC = 60°, from which we deduce 
that AABC is equilateral. Similarly, the equation a — b = e°"/9(c — b) means A is the image of C upon a © radian 
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counterclockwise rotation about B. Such a rotation is the same as a 27 — 5E = 3 rotation clockwise, from which we 
again conclude that CB = BA and ZABC = 60°, so AABC is equilateral. 


Conversely, if AABC is equilateral, then a 60° rotation about B maps C to A, because BC = BA and ZABC = 60°. 
If this rotation is counterclockwise, then we have a — b = e™/3(c — b), and if it is clockwise, then a — b = e&”/3(c — b). 


Combining the previous two paragraphs, we conclude that AABC is equilateral if and only if a — b = C(c — b), 
where Ç is a primitive sixth root of unity. O 


Important: Let AABC be a triangle in the complex plane whose vertices correspond to | 
Vv the complex numbers a, b, and c. Triangle ABC is equilateral if and only if 
a—b=C(c—b), where C is a primitive sixth root of unity. 


Let’s put our new tool for equilateral triangles to work. 


Problem 8.25: Suppose AP; P2P3 and AQ)Q2Q3 are equilateral triangles with the same orientation. (By same 
orientation, we mean that the vertices are labeled in the same order, either clockwise or counterclockwise, 
in each triangle.) Let Mı, Mz, and M3 be the midpoints of P1Qi, PQ, and P3Q3, respectively. Show that 
AM,M2Ms3 is an equilateral triangle. 


Solution for Problem 8.25: It’s not clear where to start using the tools of Euclidean geometry, but with complex 
numbers, we don’t even need a diagram to solve the problem quickly. We have a corresponding complex number 
Pi, Ji, or m; for each of the vertices of the the three triangles, so that m; = (p; + q;)/2 for i = 1,2,3, because M; is the 
midpoint of P;Q;. 


Since AP) P2P3 is equilateral, we have 
p2— Pi = C(ps — pı) 
for some primitive sixth root of unity Ç. Similarly, equilateral AQ;Q2Q3 gives us 
q2 — 91 = C(qs — 91). 


We know that the primitive sixth roots of unity in these two equations are the same because the triangles have the 
same orientation. If we didn’t know that the two triangles have the same orientation, we couldn’t take the next 
step, which is to add the two equations to produce expressions of the form p; + qi: 


(p2 + q2) — (pı +91) =F ((p3 + 93) — (Pı + 91)) - 


Since p; +q; = 2m; for i = 1,2,3, we can write the equation above as 2m — 2m, = C(2m3 — 2m). Dividing by 2 gives 
us m — mı = C(m3 — mı), so AMıMM; is equilateral. 0 


Problem 8.26: Points X and Y are outside AABC such that AXAB and AYBC are equilateral and do not overlap 
AABC. Let P be the midpoint of XB, Q be the midpoint of YB, and R be the midpoint of AC. Prove that APQR 


is equilateral. 


Solution for Problem 8.26: We know how to handle equilateral triangles and midpoints with complex numbers. As 
usual, we use the corresponding lowercase letter to represent the complex number that corresponds to each point. 
We'll present two solutions. 


Solution 1: Work forwards. Our goal in this approach is to express p, q, and r in terms of a, b, and c, and then use 
these expressions to show that APQR is equilateral. Since P and Q are the midpoints of XB and YB, we'll have to 
find x and y in terms of a, b, and c first. We do so with equilateral triangles XAB and YBC. We focus on rotation 
about point B, the vertex the two triangles have in common. What’s wrong with this start: 
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Bogus Solution: We have x-b = C(a—b) and y—b = C(c—b), where C is a primitive sixth root 


of unity. If we subtract the second equation from the first, we eliminate b. 


We see the problem with this start when we look at the diagram at right. The X 
equation x — b = C(a — b) comes from the fact that X is the image of A upon a 60° Q 
rotation counterclockwise about B, but the equation y — b = C(c — b) comes from the SS 
fact that Y is the image of C upon a 60° rotation clockwise about B. The orientations V7 
of these two rotations are different, so we cannot use the same value of C for each es 


rotation. We therefore cannot simply subtract the second equation from the first to 
eliminate b. A R 


The orientations of rotations matter in complex numbers, just as they matter 
in geometry. 


To fix this problem, we write equations using rotations of the same orientation, X 
as marked in the diagram, so that we can use the same primitive sixth root of unity: 


p 
Q 
a—b=C(x—b) RY Y 
y =b = e= b) ae d 
A R G 


where C is a primitive sixth root of unity. Note that we cannot be sure which primitive 

sixth root of unity C is. In the diagram above, we would have C = e™/3 but if the 

vertices of AABC were labeled in the opposite order, we'd have C = e&/°. By simply writing C, we cover both 
cases, as long as we use the same orientation of rotation about B to produce the equations above. 


WARNING!! 
rx 


a 


Before continuing with finding p, q, and r, we see that we can simplify our equations for x and y a great deal 
by letting b be the origin, which gives a = Cx and y = Cc. We therefore have 


y= (8.5) 
y= Cc. (8.6) 


Point B is also a natural choice for the origin because B is part of the definitions of both P and Q. 


‘Concept: Strategically choosing the origin when applying complex numbers to geometry 
O= can greatly simplify the algebra you'll have to deal with. 


Next, we find p, q, and r from the fact that P, Q, and R are the midpoints of XB, YB, and AC: 


_*ž DL 
p= = 
Tun 
q = = 


X 

pii 

y 

2 , 

We have x = # from line (8.5) and y = Cc from line (8.6), so we can write p and q in terms of a and c: 


_ a 
are 
_& 
1575: 
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We have a bit of a puzzle when we turn to APQR. We can test whether PQR is 
equilateral by testing rotation about any of the three vertices. Suppose we choose 
point R. Which should we try to confirm to check if APQR is equilateral, p-r = C(q—r) 
or q -r =C(p—r)? We choose p — r = C(q — r) because the orientation of 60° rotation 
about R that would map Q to P is the same as the orientation of the 60° rotation 
about B that maps X to A. This latter rotation is the one that gave us C, and we 
want to be able to use the same primitive sixth root of unity to avoid confusion and 
simplify the algebra. We have: 


Jog mre (o a 
P i 2C 2 a 2C ee 
eo — 
ig page —S\22ee, 


Our expressions for p — r and C(q — r) don’t appear to be the same. We can only be sure they’re the same if the 
coefficients for a match and the coefficients for c match. In other words, we must check if 


te = -> (equating the coefficients of a), 
Ee ' S 
Eo (equating the coefficients of c). 


These both simplify to the same equation: Ç? — Ç + 1 = 0. So, if we can show that C? — Ç + 1 = 0 for both primitive 
sixth roots of unity Ç, then we can conclude that our expressions for p — r and C(q — r) are the same. We could test 
that this equation is true for both primitive sixth roots Ç, but there’s a slicker way to confirm that CCEA 
First, we note that C is a sixth root of unity, so it is a solution to zf — 1 = 0. Factoring the left side as a difference of 
squares gives 

e =e +A 


The roots of z? — 1 are cube roots of unity, so they cannot be primitive sixth roots. Therefore, Ç is a root of z° + 1. 
Factoring this as a sum of cubes gives (z + 1)(z* — z + 1). Since —1 is a primitive square root of unity rather than a 
primitive sixth root of unity, we know that C is not a root of z + 1. So, we know that the two primitive sixth roots 
of unity are roots of z? — z + 1 = 0. This means we have (* — Ç + 1 = 0, as desired. Therefore, we have 1 — Ç = -? 
and Ç? — Ç = —1, and we can write our expressions for p — r and C(q — r) as follows: 


Therefore, we have p — r = C(q —r), as desired. 


One key step in this solution was noticing that two very different-looking expressions involving Ç are equiva- 


lent. 
Concept: | When working with roots of ‘unity, we will occasionally encounter two very 
O= different-looking expressions that we can show are equivalent. 


Looking back, there wasn’t any reason to know immediately that = = -§, but once we discovered that we 
needed this to be true to solve the problem, we focused on it and were able to prove it. What if we had taken this 
strategy for the whole problem—what if we focused first on what we wanted to be true? 
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Solution 2: Work backwards. In this solution, we start with expressions for p,q,and X 


rin terms of x, y, a, b, and c, and work backwards from an equation that implies that P Q 
APQR is equilateral. Our goal is to find an equivalent equation that we can prove is Dy, Y 
true. 
As in the first solution we f find expressions for p, q, and r using the fact that P, Q, Ta Van 
and R are the midpoints of XB, YB, and AC: A R G 
_ ZEO 
U’; 
_ytb 
ee) 
a+c 
f= P 


There are a variety of equations we can write in terms of p, q, and r that are true if and only if APQR is 
equilateral. We choose one involving p — q since this will eliminate b when we substitute the expressions above. 
Specifically, if we can show that p - q = C(r—q), where C is a primitive sixth root of unity, then we can deduce that 
APQR is equilateral. Substituting our expressions for p, q, and r, we wish to show that 


oo. o en 


We turn to equilateral triangles AXAB and AYBC in order to produce equations involving x, y, a, b, and c. In 
writing equations including C, we must make sure to use the same orientation of rotation that we used when 


writing p — q = C(r— 4). 


There are three different equations we could write for each triangle, one for rotation about each vertex. Rather 
than write them all down, we use our target Equation (8.7) as a guide. Since AB is a side of AXAB and YC is a 
side of AYBC, we can write the right side of Equation (8.7) in terms of complex numbers corresponding to sides 


of these two triangles: 
=y =( (E= 
2 2 l 

The a — b tells us to focus on the rotation about B in AXAB and the c — y tells us to focus on the rotation about Y in 
AYBC. These rotations (taking the same orientation we use to write p — q = C(r — q)) give us 

x—-b=Ca-—b), 

b- y= Uc- y). 
Aha! If we simply add these two equations, we get 

x= y a Ure 0). 


Dividing this equation by 2 gives us Equation (8.7), which we have shown is equivalent to p — q = C(r - q). 
Therefore, we can deduce that APQR is equilateral. O 


Our second solution is much simpler than the first. If we had been lucky in choosing the right equations for 
AXAB and AYBC in the first solution, we might have stumbled on the simpler approach we found in the second 
solution. The key to our second solution was working backwards to give us clues how to approach the problem 
in a way that would provide the desired result quickly. 


‘Concept: Working backwards from the desired result is a powerful problem solving strat- 


Os% ey 
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Equilateral triangles and squares are not the only regular polygons for which complex numbers can be helpful. 


Problem 8.27: Letn > 3 bea positive integer, and let PoP; --- P,-1 be a regular n-gon inscribed in a circle with 
radius 1. Compute Poet PoPa + Poe nt: 


Solution for Problem 8.27: If “circle with radius 1” isn’t enough of a hint to try complex numbers, “regular n-gon” 
sure is, because we have an easy way to represent a regular n-gon on the complex plane—with the n™ roots of 
unity. We let po,pi,...,Pn-1 be the n™ roots of unity, and let P, correspond to px for each root. Moreover, we let 
pk = eK!” for each k, which makes po = 1. This is particularly convenient because we can now express each length 
PoP, as |po — pkl = |1 — e?“*/"|. Therefore, the desired product is 


|1 = aa N |1 a oe r |1 = pee 5 |1 = ee : 


If we don’t recognize what to do right away, we can experiment. 


Concept: When a problem is given in terms of an integer n, experimenting with small values | 


Q= ofn can give clues to the solution. | 


When n = 2, the product is simply |1 — (—1)| = 2. When n = 3, we have |1 - | - |1 — w|, where w = e27#/9, 
Computing this product gives 


1-ol |1 - @?| = |(1 - w)(1 - w)| = l-0 -wto|. 


We can simplify this expression by noting that because w is a primitive cube root of unity, we have w =i = 0, 
Therefore, not only do we have w? = 1, but factoring gives us (w — 1)(w* +w +1) = 0. Since w + 1, we have 
w? +w +1 = 0, so w +w = —1. Therefore, we have 


|1 -a-w +a?|=1- (a +) +1) =3. 


Important: If w is a primitive cube root of unity, then w*+wt+1=0. 


Y 


So, when n = 2, the product is 2, and when n = 3, the product is 3. We now have a guess for what we'll get 


when n = 4. Let’s check: 
[a —a]-|1—(—1)| -1-C) = 21-90 + i) = 4. 


We now have a pretty strong suspicion that the product equals n for any positive integer n, but it’s still not so 
clear how to prove it. Let’s take another look at the product, and think about where else we have seen similar 
products: 

[c= une ie eamin]. j1 — EN... |i = et D2nifr | , 


Each term is the difference of 1 and an n® root of unity. We have seen a product of differences involving the n'h 
roots of unity before, in the factorization of the polynomial z” — 1. If f(z) = z” — 1, then we can write f(z) as 


f(z) =(z— 1) (z - ar) (z 5 ea) (z ~ eee) ve (z e: aa) l 


Except for the first term, z — 1, this looks almost exactly like our desired product. Fortunately, we can also factor 


o a OE 
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f(z) as (z-1)(z"1 +2"? +273 + --- +1), which allows us to divide by z — 1 to get 


(z = een) (z e ean) (z = E Io (z a = fo 


z= il 
M E A 
E a= il 


= Zh lg 2g tS e. 
We also could have jumped straight to 
(z = ae) : (z = h (z = el) en (z ví E) = tl a B a eee 


by noting that the roots of the polynomial on the right are all the n™ roots of unity except 1. Speaking of 1, we let 
z = 1 in this equation to find that 


|1 _ in |1 a geet) WES, la _ en Vania! = (a = a) ; (1 = ls) Wea (1 = Aes) 
| 
= (1 +1 41734-41141 
= ih, 


as expected. O 


Exercises p= 


8.4.1 Suppose PQRS is a square in the complex plane, and that p and q correspond to P and Q, respectively. Find 
the two possible complex numbers that correspond to R in terms of p and q. 


8.4.2 Let O be the center of a circle w. Points A, B, C, D, E, F on w are chosen such that the triangles OAB, OCD, 
OFF are equilateral and have the same orientation. Let L, M, N be the midpoints of BC, DE, FA, respectively. 
Prove that triangle LMN is equilateral. 


8.4.3 Let ABCD be a parallelogram. Suppose we construct equilateral triangles ABG and BCF externally on sides 
AB and BC, respectively. Show that triangle DFG is equilateral. 


8.4.4 Prove that each of the following are necessary and sufficient conditions for the triangle formed by plotting 
complex numbers a, b, and c in the complex plane to be an equilateral triangle. (That is, show that if a, b, c satisfy 
the condition, then the triangle is equilateral, and show that if the triangle is equilateral, then a, b, and c satisfy the 
given condition.) 

(a) a-b = œw(b-— c), where w is a primitive cube root of unity. Hints: 281 


(b) a+ wb +w? = 0, where w is a primitive cube root of unity. Hints: 14 


regular dodecagon. 


8.5* Classic Theorems 


In this section, we exhibit the power of complex numbers by proving four classic geometry theorems. Some of 
these proofs are quite difficult, so don’t feel bad if you can’t find them on your own. But, do work through and 
enjoy them! 
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Problems 


Problem 8.28: Ptolemy's Inequality: Prove that for any four points A, B, C, D ina plane, we have 


(AB)(CD) + (BC)(AD) = (AC)(BD). 


Problem 8.29: Outer Napoleon Triangle: Starting with APQR, we construct equilateral triangles AAPQ, ABQR, 
and ACRP such that these triangles do not overlap APQR. Prove that the triangle formed by connecting the 
centers of these three equilateral triangles is itself equilateral. Hints: 243 


Problem 8.30: Monge’s Theorem: Three circles are drawn in a plane with different 
radii such that no circle is inside another. The common external tangents to each 
pair of circles are drawn, as shown. The intersections of pairs of common external 
tangents are labeled Z1, Z2, and Z3, as shown. Show that Zi, Z2, and Zs are Zz 
: : 1 
collinear. Hints: 218, 284 


Problem 8.31: Simson Line: Let Z be a point on the circumcircle of AABC besides 
A, B, and C. Show that the feet of the perpendiculars from Z to the lines containing 
the sides of AABC are collinear. Hints: 120, 61 


We used a geometric explanation of the Triangle Inequality to develop an inequality relating complex numbers. 
Let’s try turning the tables, and use complex numbers to develop a geometric inequality. 


Problem 8.28: Ptolemy’s Inequality: Show that for any four points A, B, C, D ina plane, we have 


(AB)(CD) + (BC)(AD) = (AC)(BD). 


Solution for Problem 8.28: We let complex numbers a, b, c, d correspond to A, B, C, D, respectively, so that 
(AB)(CD) + (BC)(AD) = |b — al -|d—c| + |c —b|- |d — al. 


We have to prove an inequality, and the Triangle Inequality gives us a tool to produce inequalities regarding 
magnitudes of complex numbers. To apply |w| + |z| = |w + z|, we write 


|b — al - |d —c| + |c — b| - Id — a| = \(b — a)(d — c)| + l(c — b)(d — a)l. 
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Applying the Triangle Inequality gives 
I(b — a)(d —c)| + |(c — b)(d — a)| 2 (b — a)(d —c) + (c — b)(d — a)l. 
(Note that our expression for (AB)(CD) + (BC)(AD) is on the larger side of this inequality, which gives us some 


confidence that we’re moving in the correct direction.) We can see that some of the terms in the expression on the 
lesser side will cancel when we expand, so we give that a try: 
\(b — a)(d — c) + (c — b)(d — a)| = |bd — bc — ad + ac + cd — ac — bd + abl 

= |ab + cd — be -aad|. 
If we can’t see that this expression factors, then we can always get a hint from the desired inequality. We want to 
show that (AB)(CD) + (BC)(AD) is at least (AC)(BD), which equals |(c — a)(d — b)|. Expanding this gives us exactly 
the expression above, |ab + cd — bc — ad|, and now we have our proof: 

(AB)(CD) + (BC)(AD) = |b — al-|d — cl + |c — b| - |d — al 
= |(b — a)(d —c)| + |(c — b)(d — a)| 
> |(b —a)(d — c) + (c — b) (d — a)| 


= |ab + cd — bc — ad| 
= |(c — a)(d — b)| 
= (AC)(BD). 


As an extra challenge, see if you can discover the conditions under which equality holds in Ptolemy’s Inequality. 


Problem 8.29: Outer Napoleon Triangle: Starting with APQR, we construct triangles equilateral triangles 
AAPQ, ABQR, and ACRP such that these triangles do not overlap APQR. Prove that the triangle formed by f 
| connecting the centers of these three equilateral triangles is equilateral. 


Solution for Problem 8.29: As usual, we let the lowercase version of each point be 
the complex number corresponding to that point. We start by finding expressions 
for a, b, and c in terms of p, q, and r. From equilateral triangle APQ, we have 


A 
C 
P 
a—q = C(p — q), where Ç is a primitive sixth root of unity. Therefore, we have 
a = C(p— q) + q. Similarly, from triangles BQR and CRP, we have b = Cq - r) +r 
and c = Ç(r — p) + p. Q 
R 


Note that we are careful to use the same orientation for each rotation so that we 
can use the same primitive sixth root. For example, in AAPQ, we geta -q = C(p—q) 
from considering the 3 counterclockwise rotation of P about Q in the diagram 
at right, and we likewise consider counterclockwise rotations to produce our 


expressions for b and c. 


The medians of an equilateral triangle intersect at the center of the triangle. As B 
you should have showed in Exercise 8.2.5, the centroid of a triangle in the complex 
plane is the average of the vertices of the triangle. So, if we let Z4, Z2, and Z3 be the centers of triangles APQ, BQR, 
and CRP, respectively, we have 


3 3 3 
b+q+r lq-r)+r+q+r (1+0)g+(2-0č)r 
i ee 5 gr 
Ci Ca pms epee Oa ae 
ae gs a ae ae 
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We can prove that triangle Z;ZZz is equilateral by showing that (z3 — z1) = C(z2 — 21). (Note that we are careful 
again to use the same orientation of rotation as with the earlier equilateral triangles.) Substituting, we find that 


gym = EEOT+@- Op _ A+ Opt 2 Oq _ (1-20p + (-2 + qt + Or 


3 3 3 
and 
RE geo E E aa TEE a Or 
ga 3 3 7 3 i 
E L O A 2C Oe 
te-a) = E EEN 


Uh-oh; our expression for Ç(z2 — z1) doesn’t seem to match our expression for z3 — z1. However, we do have C? in 
the expression for C(z2 — zı), and back on page 283 we saw a convenient manipulation for primitive sixth roots of 
unity. Specifically, if Ç is a primitive sixth root of unity, then Ç? — Ç + 1 = 0, so Ç? = Ç — 1. Making this substitution 
in our expression for C(z2 — z1) gives 
PR Sk Se) a a C9 9 
3 3 5 


which does indeed match our expression for z3 — z1. Since z3 — z1 = Ç(z2 — z1), we know that Z1Z2Z3 is equilateral. 
g 


| Sidenote: The “Outer Napoleon Triangle” is equilateral; is the “inner” one also equilateral? 
| 


If you construct equilateral triangles AAPQ, ABQR, and ACFP so that they overlap 
APQR, is the triangle formed by connecting the centers of these three equilateral 
triangles also equilateral? 


| Problem 8.30: Monge’s Theorem: Three circles are drawn in a plane with Z2 
| different radii such that no circle is inside another. The common external tangents 
to each pair of circles are drawn, as shown. The intersections of pairs of common 
external tangents are labeled Z1, Z2, and Z3, as shown. Show that Z1, Z2, and Z3 Zi 
| are collinear. 


Solution for Problem 8.30: The only clue that we might be able to solve this problem with complex numbers is that 
we wish to prove three points collinear. We have a pretty straightforward condition for collinearity in the complex 
plane. Letting 21, Z2, and z3 be complex numbers corresponding to points Z1, Z2, and Z3, the points are collinear 
if and only if 

(21 — 22)(23 — 22) — (21 — 22)(23 — 22) = 0. 


Extra! There’s a wonderful proof of Monge’s Theorem that involves thinking of the problem as a cross- 

samp iim section of a three-dimensional configuration. As a hint, consider the centers of the three circles 
as the centers of three spheres whose radii equal the radii of the corresponding circles. 

Solution on page 297. 
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Now our goal is to find z1, z2, and z3 in terms of the radii and 
the complex numbers corresponding to the centers of the circles. We 
let P, Q, and R be the centers of the circles, and we'll refer to the 
circles as OP, OQ, and OR. Let OP, OQ, and OR have radii a, b, and 
c, respectively, with a > b > c. The two common external tangents to 
OP and OQ meet at Z3, the two common external tangents to OQ and 
OR meet at Z7, and the two common external tangents to OP and OR 
meet at Z1. 


tangents to ©P and OQ, so they are on the angle bisector of the angle 


formed by these tangents, which means P, Q, and Z; are collinear, as My 
shown. We draw radii PX and QY to the common tangent, thereby yy G o 
forming similar right triangles PXZ3 and QYZ3. ye i 


X Y Z3 


We start with Z3. P and Q are equidistant from the common external N 


Since P, Q, and Z3 are collinear, we know that (zs — p)/(q — p) is real 
and equals the ratio of magnitudes of z3 — p and q — p. That is, we have 


z3 -p _ PZ 

a TO 
We can use similar triangles PXZ3 and QYZ3 to express PZ3/PQ in terms of the radii of the circles. Specifically, 
we have QZ3/PZ3 = b/a, so PQ/PZ3 = (PZ3 — QZ3)/PZ3 = 1 — (b/a) = (a — b)/a, which means 


Zoe PZ% 8 
g-p PQ a-b 


Therefore, we have 


£ a ~ ~ _ap-bp aq-ap _aq-bp 
A ars a ee eo 


Similarly, we have 


br — cy 
Z? = 

b-c | 

ar—c 
Z| = P 

a—c 


We have to compute the expression 
(z1 — 22)(Z3 — 22) — (Z1 — 22)(Z3 — 22), 


since we wish to show that it equals 0. This looks pretty scary, until we realize that we haven’t chosen our origin 
yet. If we choose zz to be the origin, then this expression becomes a much more manageable 


2423 — 2423. 
Moreover, by letting z2 be the origin, the expression above for zz must be 0, so 


br — cq 
=e 


=0, 
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. . Ci . . . . 
which gives us r = 7, thereby allowing us to eliminate r from our expressions for zı and z3: 


acq 
ea CO) a C7 a ae k 
=. ee b(a — c) ee bp), 
Belli OF eee 
e aaa op); 


Concept: Don’t always specify the origin right away—sometimes waiting will allow you to | 
exploit symmetry in a problem or will reveal a surprisingly useful choice for the | 
origin. 


This particularly useful choice for the origin pretty much finishes off the problem. We now see that we don’t 
even have to compute 2123 — Zız3. We have 


z _ aa (9— bp) _ c(a—b) 


z (aq—bp) ba-y 


which is real because a, b, and c are real. Since z;/z3 is a real number, the origin, Z4, and Z3 are collinear. But Z2 
is the origin, so we have shown that Z1, Z2, and Z3 are collinear! (Moreover, we have also proved a relationship 
among Z1Z2/Z3Z2 and the radii of the circles.) O 


| Problem 8.31: Simson Line: Let Z be a point on the circumcircle of AABC besides 
| A, B, and C. Show that the feet of the perpendiculars from Z to the lines containing A! 


| the sides of AABC are collinear. SA 
| b 


Solution for Problem 8.31: Let the feet of the perpendiculars from Z to Te. vic. and AB be P, Q, and R, respectively. 
As usual, we let the lowercase letters of each of the point labels be the corresponding complex numbers. In order 
to show that P, Q, and R are collinear, we would like to show that 


(p-ap -7)-P-3Xp-r)=0. 


Our first step is to find p, q, and r in terms of a, b, c, and z. Fortunately, we only have to find one of them, since 
they'll all have the same form. Once we have one of them, we hopefully can use that form to determine the other 
two. 


Since P is on BC, complex numbers p, b, and c are collinear in the complex plane. This means that 
(p - b)(b - 2) - (P - bY(b - ¢) = 0. (8.8) 


We still need to find another equation for p (the one we just found only comes from the fact that P is on BC, which 
isn’t enough to uniquely specify P). We turn to our other restriction on P, which is that ZP 1 BC. This means that 


we have i 
(p —z)(b-—c) + (p - Z)(b - c) = 0. (8.9) 
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Comparing this to Equation (8.8), we see something promising. If we add the two equations, we cancel p, and 
we can solve for p. So, we add Equations (8.8) and (8.9) by combining (b — €) terms and (b — c) terms: 


((p —b) +p —2))(b - 0) + (-G — b) + @-D)(b-c) = 0. 


We therefore have (2p — b — z)(b — ©) + (b — z)(b — c) = 0, and solving for p gives 


1 b-z\(b- 
p=3 (EPEA baz), (8.10) 


(Note that we can divide by b — € because b + c.) Well, at least we found p, but that expression on the right is quite 
a mess. We're not stuck yet, though. We haven't chosen our origin. One natural choice might appear to be z, but 
then we're left with b and z, and we’d prefer to get rid of them, too. Otherwise, after writing similar expressions 
for q and r, we'll have @, b, and @ to deal with on top of a, b, and c. (Try it and see—you'll get a nice enough 
expression for p, but once you join it with the expressions for q and r, you'll have quite an intimidating mess on 
your hands for (p — 9)(p - 7) - P - 4)(p — 7)-) 

So, we go back to the drawing board on choosing the origin. We focus on the conjugates, since they scared us 
off the choice of z as the origin. A natural choice that presents itself for the origin is the center of the circle. By 
choosing the center of the circle as the origin and letting the radius of the circle be 1, we have Ja] = |b] = |e] = |z| = 1, 
so aa = bb = c = zz = 1. This allows us to get rid of all of the conjugates in our expression for p in Equation (8.10)! 
Let’s see what happens when we substitute to get rid of the conjugates: 


i_i = z-b — c) 
p-a (EEEE vs) <3 (EE see) =} (E ves) = (bere), 


1 cezb. 
2 n Z 


b c 


Now that’s something we can work with! By symmetry, we can now write expressions for q and r, and we have 


=5 (b+c+2-) 

P=3 zJ’ 
1 ca 

q=5 (ctatz-<), 
1 


ab 
r=5 (a+b+z-2). 


Now we're ready to tackle (p — q)(p — 7) — (p — 9)(p — r). If we can show this equals 0, then we know that P, Q, 
and R are collinear. Each of the differences in this expression gives us a lot of cancellation, so it’s not nearly as 
scary as it looks. We’ll start by simplifying (p — q)(p — T), since we can simply take the conjugate of the result to get 
F - Dp -1). We find 


F i (b-a- (b-a) (2-a-@-a) 
= T-a) (1- =) Eon (1-2) 


= at — a)(z — ec) —4)(z — b). 


i SE ye 
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We're very close now! We can again substitute for the conjugates (so zZ = 1, for example), and we have 


(p— @-7) = Fb- aye (4 - =) (4-2) = 1 _o-ae-oa-yo-2) 


a Z 


Taking the conjugate of this, and then substituting for @, b, ¢, and Z, gives 
— 4)(p — 1) = aa? — A)(Z e — ¢)(b — 2) 
da nes 
4 \b a/\z c/\a c/\b z 
abcd {a-b /c-z\ {c-a /{z-b 
a ab I CZ Mh ac ) ( =) 


(a — b)(c — z)(c — a)(z — b). 


_ 1 
— 4abcd 


This equals our expression for (p - q)(p — 7) because each of the four differences here is the opposite of the 
corresponding difference in our expression for (p — q)(p — 7). Therefore, (p — q)(pP — T) equals its own conjugate, 
which means it is real. This tells us that P, Q, and R are collinear. The line that contains these points is called a 
Simson Line of the triangle.o 


The key step in this solution can be a very powerful strategy: 


When applying complex numbers to a geometry problem involving a single circle, 
letting the circle be the unit circle centered at the origin can be a very powerful 
strategy. One reason this is so useful is that for any point z on the circle, we have 
z=1 

z 


8.6 Summary 


The image of a translation of z by the complex number w is z + w, the image of a dilation of z about the origin by 
scale factor c is cz, and the image of a rotation of z about a point w counterclockwise by an angle 8 is e® (z — w) + w. 


Important: The midpoint of the segment connecting z1 and z2 in the complex plane is 252, 


Let A, B, C, and D be four different points in the complex plane, and let a, b, c, 
and d be complex numbers corresponding to these points. 


e We have AB ll CD if and only if (b — a)(d — ©) — (b —a)(d — c) = 0. This is 
equivalent to the condition that (b — a)/(d — c) is real. 


e We have AB 1 CD if and only if (b - a)(d - 2) + (6 -@)(d — c) = 0. This is 
equivalent to the condition that (b — a)/(d — c) is imaginary. 


If complex numbers w and z correspond to points W and Z in the complex. 


plane, then WZ? = (w — z)(W — 2). 
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Problem Solving Strategies 
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| 


origin. 


Vo 


Things To Watch Out For! 


WARNING! 
yy 


For any complex numbers 21, Z2, 23,.-.,2n, we have | 
kaled 22 4 leal 2 e Zot ote, | 


where equality holds if and only if z1, 22, 23,...,Zn are on the same ray from the | 


i delle ie nt nt nant andra emt thin tenet 


the complex numbers a, b, and c. Triangle ABC is equilateral if and only if | 
a — b = Ç(c — b), where C is a primitive sixth root of unity. | 


et ne rin nein nn in tnt mini mrt di 


The orientations of rotations matter in complex numbers, just as they matter | 
in geometry. , i 


e When placing a geometric problem on the complex plane, we can choose | 
where to place the origin. Choosing a convenient point can often simplify | 
the problem greatly. 


Sometimes we can prove a statement by proving a more general statement. 


| 
Maintaining symmetry in systems of equations helps us find effective ways | 
to work with the equations. | 

| 


e A little wishful thinking can go a long way. | 


Some complex number problems that don’t appear to be about geometry 
can be solved quickly by thinking about the problem geometrically. 


e When your initial approach to a problem hits a seemingly unsurpassable 
obstacle, take a step back and look for another way to view the problem that 
allows you to avoid the obstacle. 


Complex numbers can often be helpful in problems involving rotations, 
since rotations are easy to express with complex numbers. 


Working backwards from the desired result is a powerful problem solving 
strategy. 


Don’t always specify the origin right away—sometimes waiting will allow | 
you to exploit symmetry in a problem or will reveal a surprisingly useful 
choice for the origin. 


When applying complex numbers to a geometry problem involving a single 
circle, consider letting the circle be the unit circle centered at the origin. 


nein 


REVIEW PROBLEMS 


REVIEW PROBLEMS p 


8.32 Describe the transformation that maps z to Z in the complex plane. 


8.33 For all z, let fı (z) be the image of z upon a translation of 2 units horizontally and —3 units vertically, and let 
f2{z) be the image of z upon a rotation about 4i counterclockwise by §. 


(a) Find (f2 0 fi)(z) in terms of z. 
(b) Find (fi © f2)(z) in terms of z. 
8.34 Let f(z) = iz. Describe the transformation that maps z to f(z) for all z. 


8.35 What must be true about the points corresponding to complex numbers a, b and c under each of the following 
conditions: 

(alg ola — cl — |b- cl 

(b) ja—bP +\a—cP = |b-cP 


8.36 For a given nonzero complex number z, describe in words how to find the following points in the complex 
plane: 


a = (b) Zz (c) z+ zl 
8.37 Use complex numbers to show that the line segment connecting the midpoints of two sides of a triangle is 
parallel to the third side and half the length of the third side. 


8.38 Let A correspond to 3 — 2i and B correspond to 1 — 51. 


(a) AB is perpendicular to the line through B and which of the following on the complex plane: 2 — 7i, —6 + 2i, 
= ae 

(b) Find an equation whose graph consists of point B as well as all complex numbers z such that the line 
through z and B is perpendicular to AB. 


8.39 Use complex numbers to show that if ZADB = 90°, then point D is on the circle with AB as diameter. 


8.40 Suppose that 2 + i and 8 — 3i are opposite vertices of a regular hexagon in the complex plane. What are the 
complex numbers that correspond to the other four vertices? 


8.41 The median of a trapezoid is the segment with the midpoints of the trapezoid’s legs as endpoints. Use 
complex numbers to prove that the median of a trapezoid is parallel to the bases of the trapezoid, and that the 
length of the median is the average of the lengths of the bases. 


8.42 Use complex numbers to prove that the diagonals of a parallelogram are perpendicular if and only if the 
parallelogram is a rhombus. 


8.43 Leta and b be distinct complex numbers. Show that z lies on the perpendicular bisector of a and b if and 
only if p 
(a — b)z + (a — b)Z = aa — bb. 


8.44 Suppose WXYZ is a square with center O. Let T be the midpoint of WX and S be the midpoint of OY. Use 
complex numbers to show that ATSZ is an isosceles right triangle. 
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8.45 Use complex numbers to show that if W, X, Y, and Z are four points in a plane such that WX is perpendicular 
to both XY and WZ, then the midpoint of YZ is the same distance from W and X. 


8.46 Show that a line segment joining the midpoints of opposite sides of a quadrilateral and the line segment 
joining the midpoints of the diagonals bisect each other. 


8.47 Suppose ABCD is a parallelogram and that complex numbers a, b, c, and d correspond to A, B, C, and D, 
respectively. Find d in terms of a, b, and c. 


8.48 Show that |w — z| > |w| — |z| for all complex numbers w and z. 


8.49 Let P be a point inside rectangle ABCD. Show that AP? + CP? = BP? + DP*. 


8.50 Let O be the center of square ABCD. Show that no matter what line k we draw through O, the sum of the 
squares of the distances from the vertices of ABCD to k is the same. Hints: 17 


8.51 Given a triangle ABC, suppose we construct squares BCDJ, CAFG, and ABHK on the sides of the triangle 
such that the squares do not overlap the triangle. We then locate points P and Q such that GCDQ and BHP] are 
parallelograms. Prove that APAQ is an isosceles right triangle. Hints: 269 


8.52 Let z,a, and b be complex numbers such that a + b. Let w be the reflection of z over the line joining a and b, 
and let p be the projection of z onto the line joining a and b. (That is, let p be the foot of the perpendicular from z 
to the line through a and b.) Find p and w in terms of a, b, and z. Hints: 150 


8.54 If z, and z? are complex numbers such that |z1 —5 +37] < 4 and |z2 —51| < 2, find the maximum and minimum 
values of |Z; — Z2|. Hints: 35 


8.55 Letz be a complex number. Find the minimum value of 
[el tt) 2 Ae I 
Hints: 151 


8.56 Let AABC be a triangle in the complex plane whose vertices correspond to complex numbers a, b, and c. 
Show that each of the following is a necessary and sufficient condition for AABC to be equilateral: 

(a) a+ r +C = 4 +ac + bc. Hints: 111, 233 

b) 44+ t=. 

(c) Por 50 


8.57x Fora positive integer n, define S, to be the minimum value of the sum 


y (2-1-1)? +a? + /(2-2-1% +45 + (2-3-1? +a +--+ (2-0-1) +42, 


where 41, 42, ..., An are positive real numbers whose sum is 17. There is a unique positive integer n for which S, 
is also an integer. Find this n. (Source: AIME) 


8.58 Suppose ABCDE is a pentagon such that the lines connecting A, B, C, and D respectively to the midpoints 
of EB, AC, BD, and CE are concurrent. Show that the line connecting E to the midpoint of AD also passes through 
this common point. Hints: 95, 264 
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8.59 Let PoP, --: P,»_1 be a regular n-gon inscribed in a unit circle C, and let P be a point on C. 


(a) Prove that PP} + PP? + --- + PP? = 2n for any P. Hints: 172 
(b) Prove that PP} + PP} +--+ PPt_ = 6n for any P. 


8.60% There are 2n complex numbers that satisfy both z? — z? — 1 = 0 and |z| = 1. These numbers have the form 
Zm = COS Ôm + isin Om, where 0° < 6; < 02 < -+ < O, < 360° and angles are measured in degrees. Find the value 
of 02 + 04 +--+- + Ozn. (Source: AIME) Hints: 22, 113 


8.61 Regular decagon P;P>2--- Pio is drawn in the coordinate plane, with P; at (1,0) and Pe at (3,0). If P, is the 
point (xn, Yn), compute the numerical value of 


(x1 + yrt)(X2 + y2i)(x3 + y3i) «++ (x10 + Yi0/). 
(Source: ARML) Hints: 217 


8.62x The sides of rectangle ABCD have lengths 10 and 11. An equilateral triangle is drawn so that no point of 
the triangle lies outside ABCD. Find the maximum possible area of such a triangle. (Source: AIME) Hints: 161 


8.63x Let P be a point inside equilateral triangle ABC with side s. Show that 
3(PA* + PB* + PC* + st) = (PA? + PB? + PC? +87)’. 
Hints: 96, 89 


8.64x Given triangle ABC, construct equilateral triangles BCX, CAY, ABZ externally. Let B’ and N’ be the 
midpoints of BC and CY, respectively, and let M and N be the midpoints of AZ and CX, respectively. Prove that 
B’N’ L MN. (Source: Argentina) Hints: 127, 85 


- Extra! Here’s the magical three-dimensional proof of Monge’s Theorem Z2 
w that is suggested on page 289. Suppose we start with three spheres 

with different radii such that no sphere is entirely within another. 

There are two planes that are tangent to all three spheres such that Za 

the three spheres are sandwiched between the two planes. Since 

the spheres do not have the same radii, these two planes intersect 

in a line which we'll call line k. For each pair of spheres, there is 

a cone that is tangent to both spheres. The aforementioned planes 

are also tangent to each cone, and therefore include the vertex of 

each cone. So, the vertex of each of the three cones determined by 

a pair of the spheres is on the line that is the intersection of the planes. 

With all that mind, picture the cross-section that is formed by the plane that bisects the angle 
between the planes. This plane must pass through the centers of the spheres, and must include 
line k (and therefore the vertex of each cone). Moreover, the intersection of the plane and each 
cone produces the two lines that are tangent to the corresponding pair of circles. Therefore, 
this cross-section produces exactly the situation in Monge’s Theorem, and shows why the three 
points in question are collinear. 


Z3 
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A plan is just a tangent vector on the manifold of reality. - Skratch Garrison 


CHAPTER 


—— 


Vectors in Two Dimensions 


9.1 What's a Vector? 


: : eee ; : X 
A vector in two dimensions is an ordered pair of numbers, which we often denote as (*) . The numbers x and y 


are called components of the vector. Vectors have the following operations: 
e Vector addition. We add vectors by adding the corresponding components: 
GH) *(@) = (oa) 
b d) NO) 
e Multiplication of a vector by a constant. We multiply a vector by a constant by multiplying each component 


b B kb à 
a 


We often refer to real numbers as scalars when speaking of vectors. So, the product k a is commonly referred 


to as scalar multiplication. In Problem 9.4, we'll see why real numbers earn this name when speaking of vectors. 


Instead of referring to a vector by its components, we can give a vector a label. When we do so, we usually 
use a bold lowercase letter that is not italicized, like v. Other notations that are sometimes used to denote a vector 
=> A 
are Jor ô. 
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eoc a OO A VEIN 


A vector can be represented geometrically as an arrow. The arrow from (0,0) to y 
(1,3) in the diagram at right represents the vector ‘all The diagram also shows an 


arrow from (2, —4) to (3,-1). These arrows represent the same vector. The location of a 
vector is irrelevant. So, any arrow that depicts “right 1 unit, up 3 units” represents the 


vector (3) : 


We will occasionally refer to vectors with points, writing BC for the vector that is 
represented by the arrow from B to C. If we write a vector with one point, such as A, 
we mean the vector from the e origin to that point. (Note that the notation BC is slightly 
different than the notation BC. BC denotes the ray t that starts at B, passes through C, 


and keeps going forever in that direction. The vector BC is represented by an arrow that 
goes from B to C and stops there.) 


We won't always be this precise in our language when referring to vectors in a geometric context. We will 
often simply refer to “an arrow that represents a vector” as simply “a vector,” rather than littering the text with 


s n p 1 AÅ A Š 
an arrow that represents.” So, letting v = 4 , we see that both A and BC in the diagram above are v, and we 
can write A = BC =v. 
A vector with magnitude 1 is called a unit vector. The vector o can also be written as xi + yj, where i is the 


unit vector (5) and j is the unit vector Gl For example, the vectors A and BC shown at right above are both 


i+ 3j. We will mostly use the notation (3) , but will occasionally mix in i + 3j notation so you'll be comfortable 


3 
with it. 


The zero vector is the vector in which both components are zero. We often denote this vector as simply 0. 
We also have a special symbol for the set of all vectors in two dimensions: R°. To see why we use this symbol, 
recall that R is the set of all real numbers. So, we use the symbol R? to indicate the set of all ordered pairs of real 
numbers. 


a > 


Problem 9.1: We define the norm of a vector (*) to be the distance from (x, y) to the origin in the Cartesian 


G 


plane. We denote the norm of a vector v as |[v||. Find in terms of x and y. 


Problem 9.2: 


(a) Ifv = 3i- 4j, then what is ivi al 


(b) Ifv= A then what is a 


ORE o) then what is 


v : 
(d) Generalize your results from the first three parts. That is, what can we say about “al when v is a 


nonzero vector? 
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Problem 9.3: Let xi + yj be a nonzero vector, and let (r, 0), where r > 0, be the polar coordinates of the point 
with rectangular coordinates (x, y). How are r and @ related to the vector xi + yj? 


Problem 9.4: Let v be a nonzero vector. 
(a) Geometrically speaking, how are v and 3v related? 


(b) Let -v = (—1)v for any v. Geometrically speaking, how are v and -v related? 


Problem 9.5: In this problem, we explore vector addition. Let v = a and w = an 


(a) Find x and y such that v + w = Gk 


(b) Let O be the origin, let A be the point (5, 1), let B be the point (2, 4), and let C be the point (x, y), where x 


and y are your answers from part (a). What are OA, OB, OC, and AC in terms of v and w? What type of 
quadrilateral is OACB? 


Use your answers to part (b) to give a geometric description of adding two vectors. 


(c) 


Problem 9.6: In this problem, we explore vector subtraction. Let v = (7) and w a e 
(a) We define vector subtraction to fit with our definition of vector addition. The vector w — v is the vector 
u such that u + v = w. Use the definition of vector addition to find x and y such that w — v = a : 
(b) Let O be the origin, let A be the point (5, 1), let B be the point C 4), and let C be the point (x, y), where x 
and y are your answers from part (a). What are OA, OB, OC, and AB in terms of v and w? 
(c) Show that for any points P and Q in the Cartesian plane, we have PQ= = Oe Ta 


How are PÒ and QP related? 


Problem 9.1: We define the norm of a vector (7) to be the distance from (x, y) to the origin in the Cartesian 


3 Nile 
plane. We denote the norm of a vector v as ||v||. Find | ( ) in terms of x and y. 


Solution for Problem 9.1: Applying the distance formula gives us 


C) = x-0} + (y - 0} = y2 + y. 


Important: The norm of a vector v = (G) is 


livi] = yx + 42. 


The norm of a vector is also called the magnitude or length of the vector, since 
it measures the length of the geometric representation of the vector. 
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Problem 9.2: 
(a) Ifv = 3i- 4j, then what is tale 
vV 


| (b) Ifv= (,), then what is al 
z Iivi] 


0.5 v 
c) Ifv= ( | th is || —||? 
(c) Ifv _¢.2) then what is ml 


: é 3 Vv 3 
Generalize your results from the first three parts. That is, what can we say about Fal when v is a 
v 


nonzero vector? 


Solution for Problem 9.2: 


(a) We have |lv|| = 41/32 + (-4)2 = 5, so i = v = i — Sj. Therefore, we have 
v 


"ROEOR 


ĝi TR v [1m2 
(b) We have |Iv|| = 4/72 + (-7)2 = 7 V2, so Ri (102) and 


S ee 
MI V2 20° 
Very suspicious. 
(c) Wehave|lv|| = ,/(0.5)? + (-0.2)? = v0.29, so 7 = ce a): Rather than rationalizing the denomina- 


tor or getting rid of the decimals, we forge ahead finding | T7 


op ) = [0.25 + 0.04 _ 
v0.29) 0.29 7 


(d) We have a pretty good idea what the result will be. We let v = G) and work through the details. We have 


, because we're pretty confident something 


convenient will happen: 


livll 


Very convenient, indeed. 


IIvll = 4/x2 + y7, which is nonzero because v # 0, so 
Gay Gy 
x2 + y2 x? +y? ap ile 


As noted in the introduction, a vector with magnitude 1 is called a unit vector. In Problem 9.2, we showed that 


Vv 
lIvll 


O 


the result of dividing a nonzero vector by its magnitude is a unit vector. Since ivi is a unit vector, when we write 
v 
ja vl 
v= [vl 
Iivi’ 
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we express v as the product of its magnitude and a unit vector. For example, we have ||3i — 4j | = ,/32 + (-4)2 =5, 
so we can express 3i — 4j as the product of its magnitude and direction vector as follows: 


ee 
3i-4j =5 (Ži =i) 


Geometrically speaking, we can think of this unit vector as providing the direction of the vector, and the 
magnitude as telling us how far the vector goes in that direction. Since each nonzero vector can be written as the 
product of its magnitude and a unit vector, we can think of a nonzero vector as being defined by its magnitude 
and its direction (as given by the unit vector). 


We can also define the direction of a vector by using an angle, as we'll see in the next problem. 


Problem 9.3: Let xi + yj be a nonzero vector, and let (r, 0), where r > 0, be the polar coordinates of the point 


with rectangular coordinates (x, y). How are r and 6 related to the vector xi + yj? 


Solution for Problem 9.3: Since x = rcos 0 and y = rsin 0, we have 


xi + yj = (r cos 0)i + (r sin 8)j = r((cos @)i + (sin @)j). 


=/et+y = |lxit yjl, 


We have 


so r is the magnitude of xi + yj. Since 


||(cos 6)i + (sin 6)j|] = J (cos 6)? + (sin 0} = 1, 


we see that (cos 0)i + (sin @)j is a unit vector, so 0 can be used to describe the direction of the vector. O 


Problem 9.4: Letv bea nonzero vector. 


(a) Geometrically speaking, how are v and 3v related? 


(b) Let —v = (—1)v for any v. Geometrically speaking, how are v and —v related? 


Solution for Problem 9.4: 


(a) Ifweletv = @ , we have 3v = Gah The origin, the point (a, b), and the point 


(3a, 3b) are collinear, since all lie on the line through the origin with slope l (when 
ais nonzero; ifa = 0, then all three points are on the y-axis). Therefore, it appears 
that v and 3v have the same direction. To be more precise, we can express each 


as the product of its magnitude and direction. We have ||v|| = Va? + b2 and 


|3v|| = 4/ (3a)? + (Bb)? = 3 Va? + b2, so 


b 
= Vie +B ( z it rai), 
ae Va + o? 
3b 
ov = 3 Vare (— j) 
Ta Toe 
b 
-3 väri (—* i+ i) 
eae e 


We see that the vector 3v has same direction as v, but has three times the 
magnitude. That is, 3v is a ”scaled” version of v. 


302 


9.1. WHAT’S A VECTOR? 


(b) Ifv= @ , then -v = (=) . These two vectors are indicated in the diagram at 


right. The point (a, b) and the point (—a, —b) are on the line through the origin 
with slope b (when a # 0; if a = 0, then all three points on the y-axis with the 
origin in the middle). However, the origin is between (a, b) and (—a, —b), so the 
vector from the origin to (a,b) is in the opposite direction as the vector from 
the origin to (—a,—b). Since the origin is equidistant from (a,b) and (—a,—b), 
we see that v and —v have the same magnitude, but opposite direction. 


Problem 9.5: In this problem, we explore vector addition. Let v = (7) and w = ( 


(a) Find xand y such that v + w = e) ! 


(b) Let O be the origin, let A be the point (5, 1), let B be the point (2, 4), and let C be the point (x, y), where x 


and y are your answers from part (a). What are OA, OB, OC, and AC in terms of v and w? What type of 
quadrilateral is OACB? 


Use your answers to part (b) to give a geometric description for adding two vectors. 


(c) 


Solution for Problem 9.5: 


i ne 5 2 5+2 7 
(a) Following the rule for vector addition, we have ia + ( 4) = f oe a = @ 


(b) Since the arrow from O to A indicates going to the right 5 and Y 
up 1, we have OA = @ = v. Similarly, we have OB = A =W, 


and OC = (2) =v+w. To get from A to C, we go right 2 and 


up 4, so AC = lak This is w! We therefore have AC = w = OB, 


4 
which means sides AC and OB of OACB are parallel (AC and OB 


have the same direction) and have the same length (AC and OB 
have the same magnitude). So, OACB is a parallelogram. 


(c) In AOAC, we see that OA = v, AC = w, and OC = v + w. This isn’t a coincidence. We can follow exactly 
the same steps to add any two vectors. We start at the origin and draw the arrow corresponding to v. 
Then, starting from where this arrow ends, we draw the arrow corresponding to w. The arrow from the 
origin to the end of this second arrow corresponds to v + w. We can also view v + was the diagonal of the 
parallelogram with v and w as consecutive sides. 


We can also see in our diagram that vector addition is commutative. That is, we have 
V+W=WHy, 


where v + w corresponds to the path O — A — Cand w + v corresponds to the path O > B > C. 


o 


The vector addition illustrated in the previous problem can be extended to any number of vectors. The process 
of adding vectors described in part (c) is sometimes referred to as head-to-tail. addition, where the start of an 
arrow depicting a vector is the tail of the vector and the end of the arrow is the vector’s head. 


en 
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Pa 
d 

d 

We add a series of vectors by drawing the first, and then drawing the second by placing the tail of the second 


at the head of the first. We continue in this manner until all the vectors are drawn. The arrow from the tail of the 
first vector to the head of the last represents the sum of all the vectors. 


Now that we have a handle on addition, we turn to vector subtraction. 


Problem 9.6: Let v = 6) and w = PN 


| (a) We define vector subtraction to fit with our definition of vector addition. The vector w — v is the vector 
| u such that u + v = w. Use the definition of vector addition to find x and y such that w — v = o : 


(b) Let O be the origin, let A be the point (5, 1), let B be the point (2,4), and let C be the point (x, y), where x 
and y are your answers from part (a). What are OA, OB, OC, and AB in terms of v and w? 


(c) Show that for any points P and Q in the Cartesian plane, we have PÒ = G =p 
How are PQ and OP related? 


Solution for Problem 9.6: 
(a) Ifw-v= (*) , then we must have w = G) + v. Applying the definition of vector addition, we have 
DG +) =a) 
4) \y Var vy rly” 


Therefore, we must have x +5 = 2 and y +1 = 4, so x =2—5 = —3and y = 4-1 = 3. In other words, vector 
subtraction works in exactly the way we would expect: 


Aaa = G3) = GG). 


In Geg we can follow onen same process to show that: 


Important: — For any vectors e and ( ab we have 


hia o A =A 


TE I a Ne a I 


This Te means mine we can view vector subtraction as a combination of vector addition and scalar 


va on (1) (a) = (6) + (@) = (6) + (a) = GD 
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SS Sn SSS SSS SSS SS SSE tS 


(b) As in the previous problem, we have OA = v and OB = w. Since 


(x, y) = (-3, 3) from part (a), we have o = E) = w-v. Since point 


B is 3 to the left and 3 above A, we have AB = Gy) , SO AB=w-v. 


(c) In the diagram, let O be the origin. From our definition of vector subtraction, the Q 
vector G — P is the vector we must add to P to get O Consider the vector sum P + PÒ. 
Since P is the vector from the origin to P, and the vector PÒ is the vector from P to Q, 
when we perform head-to-tail addition to get P+ PO, we go from the origin to P, and 
then from P to Q. So, the vector sum P+ PQ equals 0. Therefore, PO: is indeed the 
vector we must add to P to get C. which means PÒ = Ge P. O 


Cu 


P 


(d) Using the relationship from part (c), we have PÒ = G -P and QP =P- O, so PQ = —OP. We could also 
have noted that the vector from P to Q (the vector PQ) has the same length but opposite direction as the 
vector from Q to P (QP), so we have PÒ = —OP. 


| Important: _ For any points A and B, we have AB = B - A. 
| 


| Exercises 


9.1.1 Use the definitions of vector addition and multiplication of a vector by a scalar to show that 0 + v = v and 
Ov = 0 for all v. 


9.1.2 Evaluate the following: 


(3) 
5 
9.1.3 Express each of the following as a single vector: 
4 5 =í =l —6 0 
waa ona E 


9.1.4 IfA, B,C, and D are points in the Cartesian plane such that AB = CD and the four points are not all on the 
same line, then must ABDC be a parallelogram? 


@ | © 7-24 


9.1.5 Show that ||cv|| = |c||Ivl| for all vectors v and scalars c. 


9.2 The Dot Product 


In the previous section, we learned how to add and subtract vectors and how to multiply a vector by a scalar. In 
this section, we define a special product of two vectors, the dot product. We denote the dot product of vectors v 
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and w as v- w. Letting v = a and w = @ , We define the dot product v- w as 
2 2 


vı W1 
a = VW + VW. 
02 wW : 


The dot product is also sometimes referred to as the inner product. 


4 5 =% 
(3) a= (a)r (7): 
(a) Evaluate p -qand q: p. 

(b) Evaluate p : (2r) and 2(p - r). 
(c) Evaluate p-(q+r) andp-q+pr-r. 


Problem 9.7: Let p = 


(d) Generalize your results from parts (a), (b), and (c). That is, show that for all vectors u, v, w and all scalars 
a, we have u -v = v: u, u: (av) =a(u-v)andu:(v+w) =u-vtu-w. 


Problem 9.8: Show that for any vector v, we have v: v = ivi. l 


Problem 9.9: In this problem, we find a geometric interpretation of the dot product of two nonzero vectors. 


(a) Let V and W be points in the Cartesian plane such that V = v and W = w, and let O be the origin. Let 
0 = LWOV. Use AWOV to express (v — w) - (v — w) in terms of |[v||, Iwll, and 8. 


(b) Expand the product (v — w) - (v — w) using your results from Problem 9.7. 


(c) Combine your results in parts (a) and (b) to show that v - w = |[v|| ||w|| cos 8. 


Problem 9.10: Find the angle between each pair of vectors below. 


TEO 
o (2%) and (3) 
© (j) ana (25%) 


Problem 9.11: We say that two vectors are orthogonal if their dot product is zero. Suppose we draw arrows 
representing two nonzero vectors from the same point. If these vectors are orthogonal, then how are the arrows 
that represent them related? 


Problem 9.12: If p- q = p-r for a nonzero vector p, then must we have q = r? 


Problem 9.13: Use the dot product to prove that 


(ayi + xy) < (xt + ANY + y3) 


for all real x1, x2, y1, and y2. When does equality hold? 
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Problem 9.14: Suppose f is a function whose domain is R?, which means that its domain is all vectors in two 
dimensions. We say that f is a linear function if 


(i) f(av) = af(v) for all scalars a and vectors v, and 
(ii) f(v +w) = f(v) + f(w) for all vectors v and w. 
Suppose 9(w) is a linear function that maps vectors to scalars. That is, it takes a vector as an input and gives a 
2 3 r 
scalar as an output. For example, g(w) = (7) -w is one such function. 
(a) Explain why g(xi + yj) = xg(i) + yg(j). 


(b) Why does part (a) tell us that there exists some vector v such that g(w) = v- w for all w? Is this vector v 
unique? 


Problem 9.7: Let p = Gy q= (3), and r = € 
(a) Evaluate p: qand qp. 
(b) Evaluate p - (2r) and 2(p-r). 


(c) Evaluate p-(q+r)andp-q+p-r. 


Generalize your results from parts (a), (b), and (c). That is, show that for all vectors u, v, w and all 
scalars a, we have u : v = v - u, u- (av) = a(u-v)andu:(v+w) =u-v+u-w. 


Solution for Problem 9.7: 
(a) We have p- q = (4)(5) + (—3)(2) = 14 and q` p = (5)(4) + (2)(-3) = 14, so p-q = q - P- 
(b) We have 
p-(2)= (5) (Fp) = O9 + E304 = -58 


and 


2px) =2((4)-(P)) = 24-2) + 37) = -58. 


Interesting; we have p- (2r) = 2(p-r). 


(c) We have 
rar (OG Ooa- 


p-qtp-r= Gaa +5) (7) =14-29 = -15. 


We're not surprised to see that p-(q+r)=p-qt+p-r. 


(d) Letu= ey v= oy and w = or We then have 


and 


UV = UU + «202, 


VU = Viu + VU = U,V, + U22, 
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so the dot product is commutative: we have u- v = v-u for all vectors u and v. We also have 


u: (av) = (2). (892) = (ulao) + (u)(a02) = auon + uzo) 


uz av2 


a(u-v)=a ce . ey = a(uyv, + U2U2). 


Therefore, we have u- (av) = a(u - v) for all scalars a and vectors u and v. 


and 


Turning to u- (v + w) and u-v + u- w, we have 


uy vı + Wy, 
w) = = u1(v, + W1) + Uo(vo + w 
u-(v+w) A E 1(vı + W1) + u2(V2 + w2) 


and 
uy C1 uy wi 
u-v+u:w= . + . = U,V, + U2U2 + Uy Wy + UQW2 = U1 (V1 + W1) + U2(V2 + W2). 


U2 02 u2 W2 


As expected, we have u : (v + w) =u-vt+u-w. 


Important: 


For any scalar a and vectors u, v, and w, we have 


u-v=V-u, 
u- (av) = a(u- v), 
u-(v+Ww)=u-vt+u-w. 


As we noted on page 307, a function f with domain R? is a linear function if f(av) = a f(v) for all scalars a 
and vectors v, and f(v + w) = f(v) + f(w) for all vectors v and w. Together, the properties u - (av) = a(u- v) and 
u:(v+w) =u-v+u-wtell us that for any vector u, the function f(v) = u- v is a linear function. Combining this 
with the fact that u- v = v- u, we see that f(v) = v- u is also a linear function. We care about this because so many 
relationships in a great many fields of study are linear relationships. After we get a little more experience with 
the dot product, we'll prove that every linear function that maps vectors to scalars can be represented with a dot product. 
So, by studying the properties of the dot product, we are studying the properties of linear functions that arise in 
a wide range of fields. 


Sidenote: We say that the function g(u, v) = u- v is bilinear because it is linear in both "| 
N and v. 


Solution for Problem 9.8: Let v = a , SO that 


ep ey N Ue 2 
v =) (9) =o +03 = IIe. 
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Important: For any vector v, we have v: v = Ilvil?. 


We can use the basic principles we proved in Problems 9.7 and 9.8 to develop a geometric interpretation of the 
dot product. 


| Problem 9.9: Let 0 be the angle between nonzero vectors v and w. Show that v- w = I|v|| |lw|| cos 8. j 


Solution for Problem 9.9: The presence of ||v||, ||w|| and cos 0 suggest a geometric approach. 
At right, we have points V and W, where V = v and W = w, and we let the origin be O. 
We therefore have OV = |[v||, OW = |lw||, and WV = ||WV]| = Iiv - will. 


We try the Law of Cosines, since this will produce a term of the form ||v||||w|| cos 8. 
The Law of Cosines gives us WV? = OV? + OW? — 2(0V)(OW) cos 0, or 


2 2 2 
llv — wll? = Iv + Ilwil — 2 [vil [wll cos 8. 


To introduce dot products, we apply the relationship we proved in Problem 9.8, which gives us Ivl? = v- v, 
Iwl? = w - w, and ||v — wl? = (v — w) - (v — w), so we can write the Law of Cosines equation above as 


(v-w): (v-w)=v:v+w-w-2livli iwl] cos 8. 


This equation doesn’t have v- w, yet. Applying the fact that the dot product is distributive, which we proved in 
Problem 9.7, we have 


(v-w) (v-w)=(v-w) v-(v-w)- w 
=V:V-W'V-V':W+W: W. 
Since v - w = w- v, our Law of Cosines equation now is 
v:ev-2v:-w+w:w=>=v:v+w:w-—2l|ivlillwl]cos 8. 


Simplifying this equation gives 
v- w =||lv|| ||lwl| cos 0. 


0 


Important: If Ois the angle between nonzero vectors v and w, then 


v -w = ||v|| Ilw] cos 8. 


Rearranging this equation gives 
vw 
cos 0 Sy. 
IIv il iwl 


We can use this relationship to find the angle between two vectors. 


Problem 9.10: Find the angle between each pair of vectors below. 
-1 0 12 2 A me 
| (a) (a) and e (b) (73) and ia (c) 1 and ie £213 


Solution for Problem 9.10: In each of the following parts, let 0 be the angle between the two given vectors. 
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(a) We have 
_ COANE | -2 VB _ NB 
COS 9 = — == R a 
V612 + (V3) - V + 2 
so 0 = 150°. 
(b) We have 
a a = 
122 + (—8)2 - V22 + 32 
so 0 = 90°. 
(c) We have 
cos 9 = —C2-VD+WC1+2V3) J 
V2 +4 12 -/(-2- V3)? + (1423) seo 2 
so 0 = 120°. 
o 


Problem 9.11: We say that two vectors are orthogonal if their dot product is zero. Suppose we draw arrows 
representing two nonzero vectors from the same point. If these vectors are orthogonal, then how are the arrows 


that represent them related geometrically? 


Solution for Problem 9.11: Let the vectors be v and w. Since v and w are nonzero, we have |{v||||w||cos 0 = 0 if 
and only if cos 0 = 0, which means the angle between the vectors is 90°. Therefore, the arrows representing the 
vectors are perpendicular if and only if the vectors are orthogonal. 


So, we can interpret “orthogonal” geometrically to mean “perpendicular.” Since these concepts are so closely 
related, they share a symbol. We can write that v and w are orthogonal as v L w. O 


If the dot product of two vectors is 0, then then the vectors are said to be | 
orthogonal to each other. Geometrically speaking, two nonzero orthogonal | 


vectors are perpendicular. | 


The vector 0 is orthogonal to all vectors. 


[Problem 9.12: If p -q = p: r for a nonzero vector p, then must we have q = r? j 


Solution for Problem 9.12: No. Rearranging the equation p- q = p-r gives p - q — p-r = 0, which we can “factor” 
as p - (q - r) = 0, because the dot product is distributive. Therefore, if p - q = p-r, all we know is that the vector 
q — ris orthogonal to p. We cannot conclude that q = r. 


Important: 


Now that we know what to look for, we can quickly find an example for which p - q = p-r, but q # r. Let 
p= E Then, we find x and y such that ey ae = 0. One such pair is (x, y) = (1,2). Therefore, if q—r = ie) 


then we have p- q = p-r. We can choose any vector for r and solve for the corresponding q. Specifically, suppose 


r= (5), then we have q = (3) +r= (7) and indeed we have p-q= p-r = 4, but q + r. g 


WARNING!! We cannot “divide” vectors from dot products. For example, if u-v = u- w, | 
ry 
_. # then we cannot deduce that v and w are the same. -_ 
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Problem 9.13: Use the dot product to prove that 


(xyi + X2Y2)* < (xf + Fyi + y3) 


for all real x4, x2, y1, and y2. When does equality hold? 


Solution for Problem 9.13: The expression on the lesser side looks like the square of a dot product and the expression 
on the greater side looks like the product of the squares of vector norms. So, we let x = A and y = k ) . Then, 
2 2 


we have 
X: y =X%Yy1 + X2Y2 


and 


ixi? ly? = 0 tA +). 


We also have x: y = ||xl| Iyl] cos 0, where @ is the angle between x and y. We must have |cos 8| < 1, so 


|cos 9| <1. 


x yon 
ixi y] 


* and substituting our expressions for 


Squaring the left and right ends of this, multiplying both sides by ||x|/? lly 
x- y and |[x||? ly? from above gives the desired 


(xiya + x2y2)? < (XF + X3) +H). 
(Note that we can perform the initial squaring because both sides of the inequality are nonnegative.) 


We have equality if and only if | cos 0| = 1, which means 0 = 0° or 0 = 180°. This occurs if and only if x and y 
are in the same direction, or in opposite directions. In either case, we must have x = cy for some nonzero constant 
c. Conversely, if x = cy for some nonzero constant c, then @ = 0° or 0 = 180°, in which case we have | cos 6| = 1, so 
equality holds. Finally, equality holds if either vector is the vector 0. Therefore, equality holds in the inequality if 
and only if y is 0 or if x = cy for some constant c. (Letting c = 0 covers the case in which x is the zero vector.) 


This result is called the Cauchy-Schwarz Inequality. 0 


We finish this section by returning to our claim back on page 308 that every linear function that maps vectors 
to scalars can be represented with a dot product. 


| Problem 9.14: We say that a function f with domain R? is a linear function if — 


(i) f(av) = af(v) for all scalars a and vectors v, and 


(ii) f(v + w) = f(v) + f(w) for all vectors v and w. 


Suppose g(w) is a linear function that maps vectors to scalars. That is, it takes a vector as an input and gives a 


2 : : 
scalar as an output. For example, g(w) = (7) -w is one such function. 


(a) Explain why g(xi + yj) = xg(i) + y8(j). 
(b) Why does part (a) tell us that there exists some vector v such that g(w) = v- w for all w? Is this vector v 
unique? 
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Solution for Problem 9.14: 
(a) Applying rule (ii) in our definition of a linear function, we have 
gli + yj) = gxi) + g(yj). 
Applying rule (i) to each of the terms on the right gives 
(xi + yj) = glxi) + g(yj) = xG) + ysl). 
(b) From part (a), we know that if w = wii + woj, we have 
g(w) = g(wii + wj) = w - Bi) + w2: gli). 


g(i) 


: ), then we have 
g(j) 


So, if we let v = ( 
g(w) = w - gG) + w2: gf) = ca . C =v: W. 
sj) w2 
Since g(i) and g(j) are unique, this choice of v is unique. Therefore, we have shown that: 


p! 


mportant: Each linear function g that maps vectors to scalars can be defined as g(w) =v-w 


Vv for a unique vector v. 


This is why dot products are so important, and why you'll be seeing them throughout your future studies 
of science and mathematics. 


9.2.1 Evaluate each of the following 
5 =2 —6 0 
@ (%)-() o (D Gs) 
9.2.2 Let u = 5i- 12j and v = 3i + 4j. Find (u — v) - (u + v). 


9.2.3 Find all vectors that have 5 as one component and are orthogonal to iy) ; 


9.2.4 Which two pairs of the following 5 vectors are orthogonal: ie f E : (5) ; (5) : ie 


9.2.5 Find, to the nearest degree, the measures of the angles of AABC if A is (0,5), B is (1, —3), and C is (—5, 4). 
9.2.6 In this problem, we find another proof that the algebraic and geometric representations of the dot product 
are equivalent. Let v = @ and w = ie , and let ġ be the angle between v and w. 

2 2 


(a) Let (ry, Oy) and (rw, Ow) be the polar coordinates of the points whose rectangular coordinates are (v1, v2) and 
(w1, w2), respectively. Express ||v|| ||wl| cos @ in terms of these polar coordinates. 


(b) Show that your expression from part (a) equals vw, + V2Wo. 
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9.3 Lines and Linear Dependence 


In your study of algebra, you probably learned that if a, b, and c are constants, then the graph of the equation of 
ax + by = c is a line. In this section, we revisit the algebra of lines in the Cartesian plane in the context of vectors. 


Geometrically speaking, any two distinct points A and B determine a line AB. We can think of this line as 
consisting of A together with all the points we can reach by going either in the direction of B or in the opposite 
direction (away from B). This leads us naturally to a description of a line using vectors. 


As a reminder, parametric equations for x and y are equations that define the relationship between x and y 
through an intermediary variable. For example, consider the parametric equations x = 3t, y = 2t. We can write 
these parametric equations as a single parametric equation in terms of vectors: 


x 3 
G) z (>) 
When we graph such a parametric equation, we graph all points (x, y) such that there is a value of t for which 
(x, y) is a solution to the equation. 


Problems 


Problem 9.15: Let A be (2,3) and let B be (—3, 1). 
(a) Explain why AB is the graph of the parametric equation 


)= (3) +#(3). 


(b) Is this the only parametric equation of the form 


(*) =u+tv 
y 


whose graph is AB? 


Problem 9.16: Let u and v be vectors with v # 0. Explain why the graph of the parametric equation 


(*) =u+iv 
y 


fits our geometric description of a line. 


Problem 9.17: 
(a) Show that if the graph of J = u + tv contains (p,q), then the graph is the same as the graph of 


Q-Q 


(b) Let vı and v2 be nonzero vectors. Show that if the graphs of a = u + sv and (5) = u + tvz both 


contain distinct points (Xa, Ya) and (Xp, yp), then the graphs are the same and vı = kv2 for some constant k. 
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Problem 9.18: 
(a) Express the equation 2x ~ 3y = 6 as a parametric equation of the form (3) =u-+tv. 


(b) Show that the graph any equation of the form ax + by = c, where at least one of a and b is nonzero, is also 
the graph of a parametric equation of the form 


xX : r 
Show that the graph of any parametric equation of the form ( ) = u+ tv, where v is nonzero, is the same 


as the graph of some equation of the form ax + by = c, where 4, b, and c are constants. 


How is the vector v related to the slope of the line i = u + tv? 


Problem 9.19: The vector av + bw, where a and b are scalars, is called a linear combination of v and w. In this 
problem, we explore the graphs of all linear combinations of pairs of vectors. Specifically, we are interested in 
the graph of the parametric equation 
x 
( ) = sv + tw, 
y 


(a) Whatis the graph if v = (7) and w = (3) ? 


where both s and ¢ are parameters. 


(b) What is the graph if v = (5) and w = ( 


(c) What is the graph if v = i) and w = ( 


Problem 9.20: Let v and w be nonzero vectors. Show that the graph of the parametric equation (3) =sv+tw, 


where both s and t are parameters, is a line if v is a scalar multiple of w. 


Problem 9.21: Let v and w be nonzero vectors such that v is not a scalar multiple of w. In this problem, we 
show that every point (x, y) is in the graph of the parametric equation 


( ) 
S tw, 
y 


(a) What is the graph of 0) =v+tw? (Note that the coefficient of v is 1, not s.) 


where s and t are parameters. 


(b) What is the graph of G) = 2v + tw? 
(c) How are your graphs in parts (a) and (b) related? 


(d) Show that every point (x, y) is in the graph of the parametric equation @ = sv + tw, where s and t are 


parameters. 
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Problem 9.15: Let A be (2,3) and let B be (-3, 1). 


(a) Explain why AB is the graph of the parametric equation G) = (3) +t E 


(b) Is this the only parametric equation of the form C) = u + tv whose graph is this line? 


Solution for Problem 9.15: 


(a) Inthe introduction of this section, we described the line through A and B as 
being all the points we can reach by starting at point A and going either in 
the direction of B or in the opposite direction (away from B). The direction 


from A to B is the direction of AB. We have AB = =) - (3) = E To 


get to any point on AB, we start from (2,3) and add some multiple of AB. 


So, AB is the graph of 
( ) (3) : a 
y l 


(b) The equation we found in part (a) isn’t the only parametric equation whose graph is AB. We can choose 
any point on the line as our starting point, and any other point on the line to travel towards (or away from) 
to produce an equation for the line. For example, we could start from B and go towards or away from A. 


Then, we have have BA = —AB = ia , and the line is the graph of 


= (a) #6): 


; 5 n } ; 
Moreover, we can use any nonzero multiple of E as the direction vector. For example, if we let s = 4k in 


the equation above, we have the parametric equation 
x =) 20 
(3) = (2) +). 
which also has AB as its graph. So, the choice of u and v such that AB is the graph of (*) = u + £v is not 


5 
unique. Vector u can correspond to any point on AB, and v can be any nonzero multiple of (z) ; 


Problem 9.16: Let u and v be vectors with v # 0. Explain why the graph of the parametric equation 


A = u + tv fits our geometric description of a line given on page 313. 


Solution for Problem 9.16: Letu = a Then, the equation “e =utiv= e + tv consists of (u1, u2) together 
2 


with all points we can reach by starting from (u1, u2) and going in the direction of v, or going in the direction 
opposite v. This is precisely the description of a line given on page 313. O 


The previous two problems are not a proof that the graph of the parametric equation P = u + fv is a line. 
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They explain that if we define a line in the Cartesian plane to be the set of points that satisfy this equation for some 
specific u and v, then this line fits our geometric description of a line. 


Definition: If u and v are vectors with v + 0, then the graph of the parametric equation 


(*) =u-+tv, 
y 


where t is a parameter, is a line. 


The vector v is called a direction vector of the line, and we’ll sometimes describe a direction vector of a line 
as being “parallel” to the line. As suggested in our solution to Problem 9.15, the direction vector of a line is not 
unique. In the next problem, we investigate how direction vectors of the same line are related. 


| Problem 9.17: 
| (a) Show that if the graph of e = u + tv contains (p,q), then the graph is the same as the graph of | 
| 


| 3 - a ie 


(b) Let vı and vz be nonzero vectors. Show that if the graphs of G) = ų + sv) and a) = w + tvz both | 


contain distinct points (x4, Ya) and (xp, yp), then the graphs are the same and vı = kv2 for some constant k. § 


Solution for Problem 9.17: 


(a) We defined a line algebraically using a point on the line and a direction. This part asks us to show that no 
matter which point on the line we choose, we'll get the same line. 


Since (p, q) is in the graph of F = u + tv, we have (") = u + tv for some scalar t. Solving for u gives 


u= ) — hv. Substituting this into a = u + fv gives 


O=O- () e= . 


To see that Equation (9.1) has the same graph as 


a = (") + sv, (9.2) 


we note that for any choice of s in (9.2), letting t = s + tı in (9.1) gives the same e Conversely, for any 


choice of t in (9.1), choosing s = t — tı in (9.2) gives the same (7). Therefore, every point in the graph of 
(9.1) is in the graph of (9.2), and vice versa, so the graphs are the same. 


(b) Geometrically speaking, this problem is a fancy way way of saying “two points determine a line.” That 
is, if two lines have two points in common, then the lines must be the same. Let’s show that our algebraic 
definition of a line satisfies this principle. 


We start by using part (a) to note that the graphs of the two given parametric equations are also the 


316 


Oo 
| Problem 9.18: 
(a) Express the equation 2x — 3y = 6 as a parametric equation of the form a =u+tv. 
(b) Show that the graph any equation of the form ax + by = c, where at least one of a and b is nonzero, is 
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graphs of 


G) = e +sv1, (9.3) 
C) = a + tvz, (9.4) 


respectively. Since (xp, yp) is in both graphs and is distinct from (x4, ya), there are nonzero scalars sp and t, 


such that 
G = (Fe) tv 
Yb = Ya bY1, 
Xp\ _ [Xa 
ee F (*) Teona 


Taking the difference of these equations gives s,v1 — tpv2 = 0, so vı = 2y, Letting k = 2 gives us vı = kvp, 
as desired. 


We now must show that graphs of (9.3) and (9.4) are the same. Substituting vı = kv2 into (9.3) gives us 


C) = & + skvo, (9.5) 


where s is a parameter and k is constant. All we have left is showing that this equation has the same graph 


as (9.4). Fortunately, that’s easy. For any choice of t in (9.4), letting s = t in (9.5) produces the same Gi 


Conversely, for any choice of s in (9.5), choosing t = sk in (9.4) produces the same E . Therefore, Equations 
(9.4) and (9.5) have the same graph. 


also the graph of a parametric equation of the form G) =u+tv. 


Show that the graph of any parametric equation of the form G) = u + tv, where v is nonzero, is the 


same as the graph of some equation of the form ax + by = c, where a, b, and c are constants. 


How is the vector v related to the slope of the line a = u + tv? 


Solution for Problem 9.18: 


(a) 


(b) 


Solving for x gives x = $y +3. Letting y = t gives us the parametric equations x = 3t + 3, y = t, or 
aay 
y/ \0 I 


If a + 0, we can use part (a) as a guide. Solving ax + by = c for x gives x = —2y + £. We can take t = y as the 


parameter, and we have 
x\ _ (c/a EA 
(es oo ba 
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If a = 0, we can simply solve for y, since a and b cannot both be 0. We have y = —{x + | = į, and taking 


t = x as the parameter gives 
c/b —a/b : 


(c) Letu= a and v = 2) Then, we have 
uo 02 


x u + tv 
ea i ea) 
y u2 U2 Uy + tv2 
SOX = 11 + tv1, y = Uz + tvz. To produce an equation of the form ax + by = c, we eliminate the parameter t. 
Multiplying x = u + tv; by v2 and multiplying y = uz + tvz by vı gives 
U2X = U10? + LU, V2, 
VIY = U2, + £0102. 
Subtracting the second equation from the first eliminates the parameter t and leaves us with 
O2X — V1 Y = Uy02 — U0}. 


Letting a = v2, b = —v,, and c = uyv2 — u2v1, we see that this equation is in the form ax + by = c. 


Unfortunately, we’re not quite finished. Our work above only tells us that every point in the graph of 


E =u+tv (9.6) 


is in the graph of 
V2X — V1 Y = U1U2 — U01. (9.7) 


It does not tell us that every point in the graph of (9.7) is in the graph of (9.6). Fortunately, we’ve already 
done most of the work to go the other direction. Part (a) tells us that the graph of v2x — viy = 102 — u20 is 
the graph of a parametric equation of the form 


a =w +P’ (9.8) 


for some vectors u’ and v’. We showed in part (b) of Problem 9.17 that if the graphs of two equations 
of this form share two points, then the graphs are the same. We know that every point in the graph of 
(*) = u + tv (and this graph definitely has more than one point because v is nonzero) is in the graph of 


(9.7), and therefore is in the graph of (9.8). Therefore, we conclude that the graphs of (9.6) and (9.8) are the 
same, which means that the graphs of (9.6) and (9.7) are the same. 


(d) Inpart(c), we saw that the parametric equation (5) = u+tv is has the same graph as 02x —V1y = t102 — U0}. 
We then have two cases to consider: 

Case 1: vı = 0. If vı = 0, then vx — vıy = uv — u01 is simply vzx = uv. Dividing by v2 (v2 # 0 

because v + 0), we have x = u1. The graph of this equation is a vertical line with undefined slope. Looking 


back at the equation e = u + tv, we see that when v; = 0, the direction vector is o I which is indeed a 
2 


“vertical” vector. Therefore, a line with undefined slope has a “vertical” direction vector, as expected. 

Case 2: vı + 0. If vı # 0, then we can solve v2x — vy = u102 — U20 for y to get y = a 2 ce +u. The 
slope of this line is 2. So, the slope of a line is the ratio of the vertical component of the direction vector to 
the horizontal component of the direction vector—this ratio is the traditional “rise over run” description of 
slope! 0 
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We have therefore seen that our definition of a line with vectors fits nicely with the “graph of ax + by = c” 
understanding of lines in the Cartesian plane from basic algebra, where the direction vector of a line plays the role 
of “slope.” 


Problem 9.19: The vector av bw, where a and b are scalars, is called a linear combination of v and w. In this 
problem, we explore the graphs of all linear combinations of pairs of vectors. Specifically, we are interested in 


the graph of the parametric equation 
(*) =sv+tw, 
Y 
where both s and t are parameters. 
(a) What is the graph if v = (i) and w = (5) 2 
(b) What is the graph if v = a and w = ( 
(c) What is the graph if v = Gl and w = ( 


Solution for Problem 9.19: 
(a) We have 


6) 12) «= 8). 


) is double the second one, so a = a. gives us x = 2y. Therefore, 


2s + 6t 
s+3t 
we see that the graph is a line through the origin. 


pe 0040-0. 


We can choose s and t to be anything, so the graph is the entire Cartesian plane. 


(c) We have 
COR) 
y) Slo p= t i 
To show that every point (a, b) is in the graph, we let s = a + 2b and t = b in the parametric equation above, 


which gives (x, y) = (a, b). Therefore, once again all possible (x, y) are in the graph, so the graph is the entire 
Cartesian plane. 


The first component in ( 


O 


The graph in part (a) is a line, while the graphs in parts (b) and (c) are the entire Cartesian plane. One key 
difference we notice in the two parts is that v is a scalar multiple of w in part (a), but not in parts (b) or (c). Let’s 
investigate further. 


Problem 9.20: Let v and w be nonzero vectors. Show that the graph of the parametric equation 


x 
( ) =sv+ tw, 
y 


where both s and t are parameters, is a line if v is a scalar multiple of w. 
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Solution for Problem 9.20: If v = kw for some scalar k, then we have 
(G) = sv + tw = ksw + tw = (ks + t)w. 


The graph of this equation is indeed a line with direction vector w. O 
Now, let’s take a closer look at what happens when v is not a multiple of w. 


Problem 9.21: Let v and w be nonzero vectors such that v is not a scalar multiple of w. In this problem, 


| we show that every point (x, y) is in the graph of the parametric equation G) = sv + tw, where s and t are 


parameters. 


(a) What is the graph of o) = v + tw? (Note that the coefficient of v is 1, not s.) 


(b) What is the graph of a = 2v + tw? 


(c) How are your graphs in parts (a) and (b) related? 


} 


(d) Show that every point (x, y) is in the graph of the parametric equation (o) = sv + tw, where s and t are 


parameters. 


Solution for Problem 9.21: Letv = D 
2 


(a) By our definition of a line, the graph is the line through (v1, v2) in the direction of w. 

(b) By our definition of a line, the graph is the line through (2v1, 2v2) in the direction of w. 

(c) The direction vectors of the lines in parts (a) and (b) are the same, so we suspect the two graphs do not 
intersect. The lines intersect at (x, y) if there are constants tı and tz such that v + tw = 2v + tow. Solving for 
v gives v = (fı — t2)w. But we are told that v is not a scalar multiple of w, so this is impossible. Therefore, 
the lines of parts (a) and (b) do not intersect. 

(d) Continuing with our arguments from parts (a) and (b), for each possible value of s, the graph of i = sv+tw 
is a line through (sv, sv2) with direction vector w. Continuing our reasoning from part (c), we see that all 
of these lines are parallel. So, the graph of P = sv + tw is a set of parallel lines. We suspect that any point 


(xo, Yo) is on one of these lines, but we have to prove it. 


We'll start by developing some geometric intuition for why any (xo, yo) y, ew tp 
must be on one of these parallel lines. At right, point P is (xo, yo), line fy is 
the line through the origin with direction v, and line fw is the line through P 


the origin with direction w. To show that P is on one of the set of parallel 


lines that is the graph of o = sv + tw, we draw line fp through P parallel 


: ; 7 , £ 
to fw. Since v is not a constant multiple of w, lines fy and fp are not parallel Q nt 
and must therefore intersect at a point we'll call Q. Since Q is on fy, we 

> 
have G = sv for some scalar s. Since €p || fy, we have QP = tw for some x 


scalar t. Therefore, G) = P=sv+tw. 


While that gives us some geometric intuition for why (xo, yo) must be on one of these parallel lines, we’d 
like our proof to rest on the same definition of a line that we’ve been using throughout this section. In 
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Problem 9.18, we showed that the graph of C) = v + tw is also the graph of the equation 


WX — W1Y = W201 — W102. 
Note that (x, y) = (v1, v2) satisfies this equation, which is expected, since (vı, v2) is on the line. This equation 


describes the line in part (a). In part (b), we considered the line o = 2v + tw. Again applying the result 


from Problem 9.18, this parametric equation can also be written as 


WX — WY = W2(20;) — w1 (2v2) = 2(w2v1 — W102). 


Similarly, for any specific value of s, the parametric equation (*) = sv + tw, where t is a parameter and 


s is fixed, can also be expressed as 
W2X — WLY = W2(sv1) — Wy (Sv2) = s(W2v1 — W102). (9.9) 


We wish to show that for any point (xo, yo), we can choose a value of s such that (xo, yo) is on graph of 
Equation (9.9). We know that (xo, yo) lies on 


W2xX — W1Y = W2X0 — W1Yo, 


since this equation is satisfied by (x, y) = (xo, Yo). So, we choose s = === in Equation (9.9): 


W201 - W102 
W2xX9 — W1Yo 


WX — Wy = s(W2V1 — W102) = 
W201 — W102 


(w01 = W102) = W2X0 — W1Yo- 


Therefore, the point (xo, yo) is indeed on one of the set of parallel lines described by a = sv + tw. Since 


this argument holds for any point (xo, yo), the graph of (*) = sv + tw is the entire Cartesian plane. 


Our argument appears to show that the graph of a = sv + tw is the entire Cartesian plane, but in 


Problem 9.19, we saw that the graph is just a line if v is a constant multiple of w. So, where does our 
argument fail if v = kw? 

The only step in our argument that might fail is when we let s = r What if the denominator, 
W201 — W102, is 0? Then, there’s no such s. If v = kw, then vı = kw; and V2 = kun, so we find that 
W201 — W102 = kwıw — kwıw = 0. This is why the argument fails when v = kw. But is this the only case in 
which wv — W1v2 = 0? 

Fortunately, yes, it is. We write w201 — W102 = 0 as W201 = W102. If wı and wz are nonzero, we can write 
this equation as 7 = 7. Setting these equal to k gives us vı = kw; and v2 = kw2, so v = kw. If wy is 0, then 
w2 + 0, since w # 0. Then, wv = wv tells us that vı = 0 and we again have v = kw for some k, namely 
k= A Similarly, if w2 = 0, we have v = TW 

Putting this all together, we see that w2v; — w1v2 = 0 if and only if v is a multiple of w. Therefore, we 
have w2v) — W v2 + 0 if and only if v is not a multiple of w, and our argument above shows that the graph 


of C) = sv + tw is the entire Cartesian plane if w2v; — w102 # 0. So, the graph is the entire Cartesian plane 


whenever v is not a multiple of w. 


In the last two problems, we investigated the graph of the parametric equation 


a 
( ) =sv+ iw, 
y 
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where s and t are parameters. We found that the shape of the graph depends on whether or not v is a multiple of 
w. This distinction is so important that we have a special name for it. We say that v and ware linearly dependent 
if one is a multiple of the other, and that v and w are linearly independent if neither is a multiple of the other. 


As we continue our study of vectors, we'll want to expand our definition of “linearly dependent” and “linearly 
independent” to describe groups of more than 2 vectors. Here’s how we do so: 


Definition: We say that vı, v2,...,Vn are linearly dependent if there are scalars 4), 42, . . - , An, not all 0, such 
that 
AYV 1 +A9V2 +++ + AyVy, = Q. 


If the only scalars that satisfy this equation are a = a2 = --- = a, = 0, then the vectors are linearly independent. 


To see why this definition is equivalent to our initial definition of “linearly dependent” when we have only 
two vectors, suppose that v and w are nonzero and linearly dependent by our first definition. Then, we must 
have v = kw. Letting a, = 1, a2 = —k gives us av + aw = 0, which shows that v and w are linearly dependent 
by our new definition. If we start instead with the fact that v and w are nonzero and linearly dependent by our 
new definition, then there must be scalars a; and az, not both zero, such that av + azw = 0. We can’t have a; = 0, 
since this would mean aw = 0, but w is nonzero and we cannot have 4; = a = 0. Since a; + 0, we can solve 
mv + mW = 0 for v to find v = —2w, which satisfies our first definition of linearly dependent. 


We also have to make sure the definitions are equivalent if either vector is 0. If v = 0, then w = Ov, so v and w 
are linearly dependent by our first definition. Also, we have 1v + Ow = 0 in this case, so the vectors are linearly 
dependent by our second definition, as well. Similarly, the definitions are equivalent if w = 0. 


Therefore, the two definitions given for two vectors being linearly dependent are equivalent. 


Important: Two vectors are linearly dependent if and only if one is a multiple of the other. | 


Vv Otherwise, they are linearly independent. 


|)__Exercises Ba 


9.3.1 Find an equation of the form ax + by = c that has the same graph as the parametric equation 
(y) = (4) +#(5) 
y) \-1 Cee 
9.3.2 Find a parametric equation of the form 0 = u + tv that has the same graph as 3x — y = 4. 


9.3.3 Find all pairs of linearly dependent vectors in the list oe 7 Coe wy ) j (a 7 a : 


(a) Show that if u and v are linearly independent, then for each vector w, then there is a unique pair of scalars 
a and b such that w = au + bv. 


(b) Show that if there is more than one pair of scalars a and b such that au + bv = w for three given vectors u, 
v, and w, then u and v are linearly dependent. 
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9.4 Projections 


A vector is normal to a line if it is orthogonal to any vector in the direction of that y 
line. So, for example, j is normal to the graph of the line y = 3, the vector i + j is normal £ 
to the graph of x + y = 7, and v is normal to line £ in the diagram at right. 


A The projection of a point onto a line is the foot of the perpendicular from the point 


{N to the line. So, in the diagram at left, P is the projection of point A onto k. 
P k 


We can extend the concept of projection to vectors. To find the projection of v 
onto a nonzero vector a, we draw v from a point O, and then draw a line through 
O with direction vector a. We then let P be the projection of the head of v onto 
this line. We call OP the projection of v onto a. (Note that O does not have to be 


the origin in this definition!) We denote the projection of v onto a as proj,v, so proj, v = OP. 


One reason we draw a line in the direction of a rather than just drawing v 
a or a ray in the direction of a is shown at right. Here, the projection of the 


head of v onto the line is on the opposite side of O from a. Even though OP is O 
in the opposite direction from a, we call OP the projection of v onto a. P 


I] Probiems ig 


Problem 9.22: 
(a) Find a vector that is normal to the graph of 3x + y = 0. 


(b) Find a vector that is normal to the graph of 3x + y + 10 = 0. 


(c) Leta, b, and c be constants such that at least one of a and b is nonzero. Find a vector in terms of a, b, and 
c that is normal the graph of ax + by +c = 0. 


Problem 9.23: 
(a) Show that if n is normal to line £, then kn is normal to line £ for any constant k. 


(b) Show that if nj and nz are nonzero vectors that are normal to the same line f, then n; and nz are linearly 
dependent. 


Problem 9.24: Let v = (4): 
(a) Find proj;v and proj,v. 
(b) Find the projection of v onto 2i. 


(c) Find the projection of v onto —776j. 


i ' = 5 
Problem 9.25: In this problem, we find the projection of v = (t W onto the vector w = ( B) Let 8 be the 


angle between v and w, let O be the origin, and let V and W be points such that V=vand Ŵ =w. 
(a) Let P be the foot of the perpendicular from V to OW. What is OP in terms of ivil, Iwil, and 0? 


(b) Find proj,v. 
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Problem 9.26: Find proj,,v for any two vectors v and w in terms of v: w, |w], and w. 


Problem 9.27: Let a = 4i + 3j and b = —i + 3j, and let p = proj,a. 
(a) Findp. 


(b) Letn = a- p. Findb-n. 


(c) Explain the geometric significance of your result in part (b). 


Problem 9.28: The distance between a point and a line is the length of the shortest possible segment from the 
point to the line. 


(a) Why does the distance between a point and a line equal the distance between the point and the projection 
of the point onto the line? 


(b) Find the distance between (4, —2) and the graph of 3x — 2y — 7 = 0. Hints: 45, 134 


(c) Generalize your results in part (b) by finding an expression for the distance between the point (xo, yo) 
and the line ax + by + c = 0, where a, b, c, Xo, and yo are constants. 


| Problem 9.22: = 


| (a) Find a vector that is normal to the graph of 3x + y = 0. 


| (b) Find a vector that is normal to the graph of 3x + y + 10 = 0. 


(c) Leta, b, and c be constants such that at least one of a and b is nonzero. Find a vector in terms of a, b, and 
c that is normal the graph of ax + by +c = 0. 


Solution for Problem 9.22: 


(a) First, we'll get an intuitive understanding what's going on here by using our 
ideas of “slope” and “perpendicular” from basic algebra. Let the graph of 
3x + y = 0 be line £. Writing the equation as y = —3x, we see that the line 
has slope —3. Therefore, any line perpendicular to has slope 1/3. So, a line 
through the origin perpendicular to must pass through (3,1). Therefore, one 


vector that should be normal to @ is a 3 


This isn’t an entirely satisfying solution; it completely avoids the question 
of why the slope of the line perpendicular to £ is 1/3. Instead of relying on 
“slope” and “perpendicular,” let’s use the definition of a normal vector. A 
normal vector must be orthogonal to any direction vector of the line. So, let’s 
try to find a direction vector of the line. If (x1, y1) and (x2, y2) are two different points on the line, then a 


X2 — X1 


ea: is in the direction of the line. Since (x1, y1) and (x2, y2) are 


vector from one to the other, such as ( 


on the line, we have 


3xı + yı = 0, 
3x2 + Y2 = 0. 


Subtracting the first equation from the second, we have 3(x2 — x1) + 1(y2 — y1) = 0. Writing the left side as a 
dot product, out pops the normal vector: 


E) 
Y= yi ` 
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SS SSS easement 


(b) 


(c) 


SA 
This tells us that @ is orthogonal to any vector between two points in the line, so it is normal to the line. 


Slew : 
The vector o isn’t the only vector normal to £. As we'll investigate in Problem 9.23, any scalar multiple 


2DE 
of (7) is also normal to £, and any vector that is normal to £ must be a scalar multiple of a (including 
negative multiples and 0). 


In part (a), the coefficients of x and y gave us components of a vector normal y 
to the line. Maybe that just happens because the line 3x + y = 0 goes through 
the origin. If we let (x1, y1) and (x2, y2) be points on the line 3x + y + 10 = 0, we 
have 


3xı + yı +10=0, 
3x2 + y2 +10 =0. 


Subtracting the first from the last gives us 3(x2 — x1) + 1(y2 — y1) = 0, and again 


we have 
eee 
. =0. 
1 Y2- 1 
So, the vector G) is orthogonal to any vector between two points on the line, and is therefore normal to 


the line. 
The first two parts give us a guide. We let (x1, y1) and (x2, y2) be on the line, so 


axı +by,+c=0, 
axı + by +c =Q. 


Subtracting the first equation from the second gives a(x2 — x1) + b(y2 — y1) = 0, so 
E 
b y2- 11 
a 


which means that @ is orthogonal to any vector between two points on the line. Therefore, a is normal 
to the line. 


Important: The vector (5) is normal to the graph of ax + by +c =0. 


While we'll be working with normal vectors only in the context of lines and | 
planes in this book, you'll be seeing them again in the context of curves and 
curved surfaces when you study calculus, physics, and other subjects. We say that 
| a vector is normal to a curve at a certain point on the curve if the vector is normal | 
| to the line through that point that is tangent to the curve. In three dimensions, a | 

vector that is normal to a surface at a point is normal to the plane through that | 

point that is tangent to the surface. 


Sidenote: 
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Problem 9.23: 
(a) Show that if n is normal to line £, then kn is normal to line £ for any constant k. 


(b) Show that if nı and nz are nonzero vectors that are normal to the same line £, then n; and nz are linearly 
dependent. 


Solution for Problem 9.23: 


(a) Let d bea direction vector of £, so n- d = 0. We showed in Problem 9.17 that any vector in the direction of 
line € is of the form cd for some scalar c. For any scalar k, we have (kn) (cd) = kc(n- d) = 0, so the vector kn 
is orthogonal to every vector in the direction of £, which means kn is normal to £. 


(b) We'll first think about the geometry of the problem, and then provide an algebraic proof. We can think of a 
vector that is normal to ¢ as being the direction vector of a line perpendicular to £. For any point, there is 
only one line through the point perpendicular to £, and we showed in Problem 9.17 that any two possible 
direction vectors of the same line must be linearly dependent. Moreover, all lines perpendicular to ¢ are 
parallel to each other, so any two possible direction vectors of lines normal to £ are linearly dependent. 


To prove algebraically that nı and nz are linearly dependent, we will show that it is impossible for them 
to be linearly independent. We start by noting that if d is a direction vector of £, then we have n; -d = 0 and 


nz: d = 0. If nm; and n; are linearly independent, the graph of G) = sn; + tn; passes through every point 


(x, y). Specifically, this means that there are some constants a; and az such that d = anı + a2n2. Multiplying 
nı : d = 0 and nz : d = 0 by a and ay, respectively, gives 


ayny -d=0, 
am: -d =0. 


Adding these gives a1nı : d + anz -d = 0, so (anı + an2) : d = 0, which means d- d = 0. But we can’t have 
d - d = 0 because d is nonzero! So, we conclude that n; and nz are linearly dependent. 


Now that we have a thorough understanding of normal vectors, we turn to projections. 


Problem 9.24: Letv = ak 


(a) Find the projections of v onto i and j. 
(b) Find the projection of v onto 2i. 
(c) Find the projection of v onto —776j. 


Solution for Problem 9.24: 


(a) To find the projections of v onto i and j, we draw v from the origin to (3,4). When i is 
drawn from the origin, it lies along the x-axis. Therefore, we project (3, 4) onto the x-axis 
to find the projection of v onto i. As shown in the diagram at right, the projection of 


(3,4) onto the x-axis is (3, 0), so the desired projection of v onto i is 0}: Similarly, the 


projection of (3, 4) onto the y-axis is (0, 4), so the projection of v onto j is Al 
(b) Just as with i, the vector 2i lies along the x-axis when drawn from the origin. So, the projection of v onto 2i 


is the same as the projection of v onto i, which is o : 
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When v is projected onto vector a, the magnitude of a is irrelevant; only its 
Q= direction affects the puec ton 


(c) Only the direction of —776j affects the projection. When —776j is drawn from the origin, it lies along the 


y-axis, so the projection of v onto —776j is the same as the projection of v onto j, which is ( 4): 


Problem 9.25: Find the projection of v = f J onto the vector w = ( 35) : 


Solution for Problem 9.25: First, we draw v and w from the origin, and we draw the line 
k containing w. Then, letting V be the head of v, we project V onto k, which gives us 
point P as shown. Our goal is to find P. We know the direction of P, since it is the 
same as that of w. So, we only have to find the magnitude of P. We can do so with 
right triangle VOP, where O is the origin. In this triangle, we have cos ZVOP = or. But 

= |Iv|| = 8, so cos ZVOP = 2. If we can find cos ZVOP, then we can find OP. Since 
cos ZVOP is the cosine of an angle between vectors v and w, this sounds like a job for 
the dot product. 


We have 


veiw _ (4-V3)(1) + (4)(v3) _ V3 


IIvll Iwi (8)(2) 2 


Therefore, we have OP = 8cos ZVOP = 4 V3. Since P is a vector in the same direction as w but with magnitude 


4 V3, we find P by multiplying the unit vector in the direction of w by the magnitude of P. The unit vector in the 
direction of w is so we have 


cos ZVOP = 


Ewa 
liwli” 


Eo Sas. > =2vV3w= (? a 


Oo 


| Problem 9.26: Find proj,,v for any two vectors v and w in terms of v - w, ||w||, and w. j 


Solution for Problem 9.26: Let @ be the angle between v and w. We’ll handle the cases of acute and obtuse 0 
separately, and then address the special cases in which @ is 0, 5, or 7. In all cases, we draw v and w from a point 
O to points V and W, respectively, and let 0 be the angle between these vectors. We then let P be the projection of 


V onto line Ow : 


Case 1: @ is acute. We proceed as in Problem 9.25. We find OP by first 
finding oP , and then scaling w by the appropriate factor. From right 


triangle VOP, we have ||OP|| = OV cos ZVOP = |Įvl|cos8. Since @ is the 
angle between v and w, we have 


Vw 


llv iiw Io wll? 


To find OP, we scale the unit vector in the direction of w by ||OP||. From Problem 9.2, the unit vector in the 
direction of w is so 


|| OP || = Ilviicos @ = Ivii ——_ 


Ewe 
jiwi” 
e = a 


roy. v= OP = — = —~ 
PrOw iwi jie 
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Case 2: @ is obtuse. We proceed as before, but now O is between W and P, V 
as shown. So, = OV cos ZVOP = |lv|| cos(z — 8) = —||vllcos 8. We then z y 
have 5 zA W W z 
OP|| = -Ilvl|cos 8 = - Iivi = = -7r 
IoP] iT ill = Iwl] P 


Note that this result is positive, because v - w is negative when the angle between v and w is obtuse. To find OP, 


we multiply the unit vector in the direction of w by —||OP||, where the negative sign accounts for the fact that OP 
and w are in opposite directions. We therefore have 


Vw 


proj,,V = Cie -||OP los IIw| = wie 


This is the same expression as when @ is acute. 


Case 3: 0 = O or 0 = n. In this case, we have v = kw for some scalar k, and we should have proj, v = v, because 
v is already in the same (or opposite) direction as w. The expression above for proj,,v gives us 


V-W kw (w) ( ) k |}wlP z ) 
zW= 7v] = z ae) =v, 
Ilwll [|wll k Iwl \k 


Case 4: 0 = 4. If v and w are orthogonal, then proj,,v = 0, since P will be the same point as O. The expression 
we derived for proj, V in the other cases delivers the correct answer in this case, too, because v : w = 0 when v and 
w are orthogonal. 


as expected. 


Since we’ve covered all the possible cases, we have shown that: 


Important: For any vector v and any nonzero vector w, we have 


Projy.V = 


Vw 
Ta zW 
[Iw 


We can test this formula for proj,,v by confirming our result in Problem 9.25. Letting v = ee) and 


w= (jz) -wehave 
veiw (4730) + (4)(V3) / 1) _ /273 
pia = Rew na (a) = 2) 


which matches our answer from earlier. 

Problem 9.27: Let a = 4i + 3j and b = —i + 3j, and let p = proj, a. 
(a) Find p. 

(b) Letn=a-p. Findb-n. 

(c) Explain the geometric significance of your result in part (b). 


Solution for Problem 9.27: 


a- abi (4(-1) + OG, E a 


PP (caer To M oa 


(a) We have proj,a = 
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(b) We have n = (4i + 3j) - (-4i+ 3j) = 3i+ ĉj, and 


TE 9 3 
-b= 2i = -(-j 1) = ofl — = 
n (i+ 5j) (—i + 3j) J ( Uts 3=0. 


Therefore, n is orthogonal to b. 


(c) Writing n = a- p as a = p + n tells the story. Suppose we draw a and b from the b 
origin. Because p is the projection of a onto b, the vector from the head of p to the 
head of a must be normal to the line that contains p. If we let this normal vector be a 
n, we must therefore have a = p + n and n- p = 0. Since b and p are in the same or P 
opposite directions, we must also have n - b = 0. 

Another way to look at this is that n is the projection of a onto a vector orthogonal 
to b. This is depicted in the diagram at right. Therefore, we see that a is the sum of 
the projections of a onto b and onto a nonzero vector orthogonal to b. 


O 


There’s nothing special about the particular vectors a and b in Problem 9.27. We can make the same observations 
for any nonzero vectors a and b. 


You'll find a lot more use for projections and for expressing a vector as a sum of two orthogonal vectors when 
you study physics. We’ll finish this section with an application to analytic geometry. 


Problem 9.28: The distance between a point and a line is the length of the shortest possible segment from the 

point to the line. 

(a) Why does the distance between a point and a line equal the distance between the point and the projection 
of the point onto the line? 

(b) Find the distance between (4, —2) and the graph of 3x — 2y -7 = 0. 


(c) Generalize your results in part (b) by finding an expression for the distance between the point (xo, Yo) 
and the line ax + by + c = 0, where a, b, c, Xo, and yo are constants. 


Solution for Problem 9.28: 


(a) Let P be the point and £ be the line. Let B be the projection of P onto ¢ and let A 
be any other point on £. The Pythagorean Theorem then gives us PB? + AB? = PA?. 
Therefore, we have PA > PB. So, P is closer to B than to any other point on £. 


(b) Let the point be P and the line be £. The shortest distance from P to £ 
is PB, where B is on £ such that PB L £. Unfortunately, it’s not clear 
from our diagram what the coordinates of B are. We could find those 
coordinates by finding the equation of the line through P perpendicular 
to £, and then solving a system of equations. Then, we’d have to use the 
distance formula, and our diagram makes it obvious that we'll be dealing 
with a lot of fractions. That looks like a lot of work. Maybe we can find 
a better way. 


Point B is the projection of P onto £, and the vector from B to P 
is normal to £. We know the direction of this vector! From our work in 
Problem 9.22, we know that 3i—2j is normal to the graph of 3x—2y—7 = 0. 
However, we don’t know the magnitude of BP because we don’t know 
exactly where B is. If only point B were a convenient point, then this problem would be a lot easier. 


Let’s take a look at a convenient point on £, such as (5,4), which we'll call point A. This point is on £? 
because (x, y) = (5,4) satisfies 3x - 2y — 7 = 0. How is this convenient point related to the desired point? 
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Triangle PAB is a right triangle. Therefore, not only is B the projection of P onto AB, but B is also the 
projection of A onto BP. Aha! We know the direction of BP—the vector normal to £ is in this direction. So, 
we can find ||BP|| by projecting AP onto the normal vector! 


In the diagram, we have drawn the vector n from A normal to f, so n = 3i — 2j. The projection of 


AP onto this vector, AD, has magnitude equal to BP, since BPDA is a rectangle. We have AP=P-A= 
(4i — 2j) — (51 + 4j) = —i — 6j, and we're ready to find the desired distance: 


= 


_|aP-n E-a] in- (EOD n 2 vis 
inl In| (2+) 13 


(c) We use the previous part as a guide. Let P be (xo, yo) and let A be a point (x, y) on the graph of ax +by+c = 0. 
The vector normal to the graph is n = ai + bj, and we seek the magnitude of the projection of AP onto n. 
We have AP = (xo — x)i + (yo — y)j, so the desired distance is 


AP - n || — |(&o= adi + Go - yi) ai + bi) | ae (xo — x)a + (yo — ue foe laxo + byo — ax — byl 
linli? Iinl|? [[n||? Vaz + b2 


Since (x, y) is on the graph of ax + by + c = 0, we have —ax — by = c, so our desired formula for the distance 
between (xo, yo) and ax + by +c = Ois 
laxo + byo + cl 


Va? + b2 


Important: The distance between the point (xo, yo) and the graph of ax + by + c = 0 is 


laxo SF byo + c| 


Va? + b? 


As one quick check, we can note that if (xo, yo) is on the graph of ax + by + c = 0, then axo + byo + c = 0, and our 
formula for the distance between (xo, yo) and the line correctly gives 0. 


WARNING!! Note that when using the formula 


laxo aF byo + c| 
Va? + b? 


to find the distance between (xo, yo) and ax + by + c = 0, all of the terms in 
the equation for the line are on the same side. If we have an equation such 
as 2x + 5y = 8, then we do not have a = 2, b = 5 and c = 8. We move the 8 to 
the other side and have 2x + 5y — 8 = 0, so a = 2, b = 5, and c =—8. 


[Exercises ie 


9.4.1 Find proj,u for each of the following: 


a wee) o u= (Fv) 
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9.4.2 Suppose proj,u = 9i — 3j. 


(a) If w = 3v, then what is proj,,u? 


(b) Ifx = 3u, then what is proj, x? 


(a) Find an equation of the form ax + by + c = 0 whose graph is a line through (3, 4) such that 5i — 3j is normal 
to the line. 


(b) Find a vector with magnitude 5 that is normal to the graph of 2x + 4y = 7. 
9.4.4 Show that if proj,u = proj,v for all vectors x, then u = v. 
9.4.5 Letv be a nonzero vector. 

(a) If proj,w = proj,(—w), then must w = 0? 

(b) If proj,w = proj._,w, then must w = 0? 
9.4.6 Letu be the unit vector xi + yj, and let u+ = —yi + xj. 


(a) Find the angle between u and ut. 


(b) Show that for any vector a, we have a = proj,a + proj,.a. Explain the result both geometrically and 
algebraically. 


(c) Show that |lal|? = (a- u)? + (a- u)?. 
9.4.7 Let 0 be the angle between u and v, where u- v # 0. Find 


u:v 
(proj,u) . (proj,,V) 


in terms of 8. 


9.5 Summary 


x 
A vector in two dimensions is an ordered pair of numbers, which we can denote as G) . The numbers x and y are 
called components of the vector. We often refer to real numbers as scalars when speaking of vectors. The sum of 


of two vectors is given by (5) + Gi = « i a , and the product of a scalar and a vector is given by k 6) = ia ; 


We denote the vector from the origin to a point C as Č. We refer to the vector from point A to point B as AB, 
and we have AB = B- A. 


x e i 
A vector with magnitude 1 is called a unit vector. The vector C) can also be written as xi + yj, where i is the 


unit vector (5) and j is the unit vector (i) . The zero vector, which is written as 0, is the vector in which both 


components are zero. We use the symbol R? for the set of all vectors in two dimensions. 


The norm of a vector v = G) is ||v|| = 4/x2 + 42. The norm of a vector is also called the magnitude or length 


of the vector, since it measures the length of the geometric representation of the vector. 
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Wy 


We define the dot product v - w as a i ( 
02 w2 


) = vw + V2W2. The dot product has the following properties: 


e u-v=v-u 
e u- (av) =a(u-v) 

eu-(v+w)=u:v+u-w 

eveve live 

e v- w= |ivli|iw]| cos 8, where @ is the angle between v and w 


We say that a function f with domain R? is a linear function if f(av) = af (v) for all scalars a and vectors v, and 
f(v + w) = f(v) + f(w) for all vectors v and w. 


Each linear function g that maps vectors to scalars can be defined as g(w) = v-w 


‘Important: 
3 for a unique vector v. 


If u and v are vectors with v + 0, then the graph of the parametric equation @ =u + tv, isa line. 


Definition: We say that vı, v2,...,V, are linearly dependent if there are scalars a1, 42, . . . , 4n, not all 0, such 
that 
A4Vt + A2V2 + ++ + 4yVn = O. 


If the only scalars that satisfy this equation are 4, = a2 = + +: = a, = 0, then the vectors are linearly independent. 


Two vectors are linearly dependent if and only if one isa multiple of the other. 
Otherwise, they are linearly independent. 


Important: 


Vv 


ee a a 
A vector is normal to a line if it is orthogonal to any vector in the direction of that line. The vector a is 


normal to the graph of ax + by + c = 0. 


The projection of a point onto a line is the foot of the perpendicular from the v 
point to the line. To find the projection of v onto a nonzero vector a, we draw v 
from a point O, and then draw a line through O with direction vector a. We then OA 
let P be the projection of the head of v onto this line. We call OP the projection of P 
v onto a. (Note that O does not have to be the origin in this definition!) We denote the projection of v onto a as 


proja V, SO proj, V = OP. 


Important: For any vector v and any nonzero vector w, we have 


, ve 
proj V = ~— zW. 
adisi a 
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Important: The distance between the point (xo, yo) and the graph of ax + by +c = O is 


laxo + byo + cl 


Va? + b? 


REVIEW PROBLEMS 


9.29 Evaluate iga 


9.30 Find k such that 4i — 5j and —3i + kj are linearly dependent. 


and ||15i — 8j||. 


9.31 Find a vector v such that ||v|| = 10 and v is in the same direction as 4i — 6j. 
9.32 Let A, B, and C be points in the Cartesian plane. Find all possible values of AB + BC + CA 


9.33 Points A, B, and C are in the Cartesian plane such that C= 1A + B). How are A, B, and C related 
geometrically? 


9.34 Show that |lv|| + Iwl] = [lv + wll. What geometric result is this known as? 


9.35 Evaluate each of the following 
i ry 27 m 
@ Gi-2))-8i oaa. 
9.36 How are u-v, ||ul|, and ||v|| related if u and v have the same direction? What if they have opposite directions? 


9.37 


(a) Suppose a is a nonzero vector and b and c are vectors such that a: b = a - c = 0. Must b and c be linearly 
dependent? 


(b) Suppose a, b, and care nonzero vectors such that a-b = Oand a-c # 0. Must b and c be linearly independent? 


9.38 Letu = 3i + 2j. Suppose each vector v such that u - v = 9 is drawn from the origin in the Cartesian plane. 
How are the heads of all these vectors related? 


9.39 Show that ||v + w]| = ||v — wl| if and only if v and w are orthogonal. 


9.40 


(a) If two vectors have the same direction, but different magnitudes, then must one vector be a scalar multiple 
of the other? 


(b) If one vector is a scalar multiple of the other, then must they have the same direction? 


9.41 If u-v is nonzero, can we tell just from this dot product whether the angle between u and v is acute or 
obtuse? 


9.42 Find proj,,v for each of the following: 


o a) © eG) 


9.43 If proj,u = u, then must u and v be linearly dependent? 
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9.44 Find both an algebraic and a geometric proof that ||proj, all < llall. 


Ml 


9.45 Prove that any three vectors u, v, and w are linearly dependent. 


_Challenge Problems 


9.46 How many vectors in two dimensions have one component equal to 2 and have magnitude 6? How are 
these vectors related geometrically? 


9.47 Suppose that u- v = 2 and v- w = —3. What are the possible values of u - w? 
9.48 Suppose a = 3i —j. Hints: 98 
(a) Explain why more than one vector b satisfies proj,b = 6i — 2j. Find two such vectors. 


(b) Suppose we also know that ||b|| = 10. Then, how many different vectors b satisfy proj,b = 6i — 2j? 
(c) Suppose that proj;b = —j. Then, how many different vectors b satisfy proj,b = 6i — 2j? 


9.49 Show that if u and v are linearly independent, then the values of proj,,r and proj,r uniquely determine r. 
In other words, if we are given values for proj,,r and proj,r, then there is always only one possible vector r that 
produces the given values of proj, and proj,r. Hints: 97 


9.50 If u,v, and w are nonzero vectors such that u = proj,u + proj,,u, then must we have v - w = 0? Hints: 46 


9.51x Find the pair of equations whose graphs are the two lines that bisect the angles formed by the graphs of 
the lines 4x — 3y = 5 and x —- 2y + 7 = 0. Hints: 220 
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If people do not believe that mathematics is simple, it is only because they do not realize how complicated life is. 


—John von Neumann 


CHAPTER 1 0 


T E in Two Dimensions 


10.1 What is a Matrix? 


Consider the system of equations 


2x +5y=1, 
—3x — 4y = —5. 


Each component on the left side is a linear combination of x and y. We have a special notation for expressing such 


a vector; we can express it as multiplication of a by a tool we call a matrix. Specifically, we can write 


Gey) Sea 


where ie a is a matrix. (The plural of matrix is matrices.) 


A matrix can be viewed as essentially a shorthand for keeping track of the coefficients of the variables in a 
system of linear equations. We say that the matrix above is a “2 x 2” matrix because it has 2 rows and 2 columns. 
In this section, we'll investigate some properties of 2 x 2 matrices, starting from the definition 


(ea) = (Sra) 


ee IIU IUU 
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Note that in this definition, the product of a matrix and a vector is a vector whose components are linear 
combinations of the components of the original vector. 


More formally, a 2 x 2 matrix represents a function that has all vectors in two dimensions as its domain, and 
such that each output vector has components that are linear combinations of the input vector’s components. The 


! f l METY. . fa 
entries of the matrix describe how these linear combinations are formed, so the matrix a J represents the 


ax + by 


function that outputs a + dy 


) for each input (*) . We express this function as the matrix multiplication 


i 3 (*) e (z $ i 
c d) \y)  \cx+dy 
rather than with the usual function notation because the matrix tells us exactly how the output is determined for 


each input. 


We will sometimes refer to a function that has vectors in two dimensions as inputs as having domain R?, and 
we'll say that a function “maps vectors to vectors” if it has vectors both as inputs and as outputs. 


We typically use a bold capital letter to denote a matrix, and sometimes use the corresponding lowercase letter 
with subscripts to represent the entries in a matrix, like this: 


Me & ee) 
Az, 422) ` 
For each entry a;;, the first number in the subscript indicates the row of the entry and the second number indicates 


the column, so 4, is the first entry of the second row. Also, just as we let 0 be the vector with both components 
equal to 0, we denote the matrix with all entries equal to 0 as 0, and call this the zero matrix. 


i _Problems > 


Problem 10.1: Compute the following products: 
Sue | —2 
@ (6 3)(s) 
Problem 10.2: How are Ai and Aj related to A? 


Problem 10.3: We’d like to define multiplying a matrix by a scalar such that (cA)v = c(Av). How should we do 
Ai 412 
A21) 422 


Problem 10.4: Show that for any matrix A and any vectors v and w, we have A(v + w) = Av + Aw. 


Problem 10.5: Let A = i: ir B= E ah and Vv = les 


so? That is, ifc is a scalar and A = ( J then what matrix equals cA? 


5 -2 1 
(a) Find Av and Bv in terms of x and y. 
(b) Find the matrix C such that Cv = Av + Bv for all v. 


Let P = iG es a and Q = (a a Suppose we define P + Q such that (P + Q)v = Pv + Qv for all 


pa pz q21 q2 
matrices P and Q and all vectors v. What matrix equals P + Q? 


10.1. WHAT IS A MATRIX? 
We start with some practice multiplying vectors by matrices. 
Problem 10.1: Compute the following products: 
3 —1\ /-2 0 = Æ 
o G) o (5 D) 
Solution for Problem 10.1: We apply our definition of multiplication of a vector by a matrix: 
C „l 6) _ (neu 
Cd) \ \cx+dy/` 
E C za = _ Ce $ ye) _ ea 
6 8 5 (6)(-2) + Bs) Baws e 


E ey len 


Multiplication of a vector in two dimensions by a 2 x 2 matrix is only defined 
with the matrix on the left. We have 


but 


GC a) ) 
y/ \c d 
is not defined. — 
(Problem 10.2: How are Ai and Aj related to A? | 


Solution for Problem 10.2: We have 


eD- m DOG 
an è an/ \0 az azn ā an/ \l Aa} 


WARNING! 
aa 


Therefore, we see that: 


omma Ai is the first column of A and Aj is the second column of A. = 
| VA SaD dkk 


You shouldn’t have to memorize this—it’s a straightforward application of our definition of the product of a 
matrix and a vector. However, sometimes this is a very useful way of thinking about matrices: the first column of 


A is Ai and the second column is Aj. 0 


Problem 10.3: We'd like to define multiplying a matrix by a scalar such that (cA)v = c(Av). How should we 
Ay 
a21 


a i 
do so? In other words, if c is a scalar and A = ( fe , then what matrix equals cA? 
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Solution for Problem 10.3: Just as with matrix addition, multiplying a matrix by a scalar works in the intuitive way, 
by multiplying each entry in the matrix by the scalar. We see this by noting that 


6 (a (*)) a (é a (*)) as Ge = aoe) 2 ee a (*) 
y > az, ar y E a21X + d22y E A21CX + Agcy Caz; CA2 y 4 
Ae (a =) 
Caz; CA22 


With this definition of multiplication of a matrix by a scalar, we can also show that A(cv) = (cA)v = c(Av). 


so setting 


gives us (cA)v = c(Av), as desired. O 


‘Important: For any scalar c, vector v and matrix A, we have 


__(cA)y = A(cv) = c(Av). 


ieai 


Problem 10.4: Show that for any matrix A and any vectors v and w, we have A(v + w) = Av + Aw. 


Solution for Problem 10.4: Writing out the details of A(v + w), we have: 


a a v w 
A21) A22 02 W2 
oy & a i) 
A2 Ay 02 + W2 
_ (41101 + 441 W + 41202 + A12W2 
A2101 + A21W 1 + 42202 + AW 


= (me + 41202 ay1W, + 3 
42101 + A2202 A21Ww1 + Aa22W2 


=n ra) (e 5 Giese 


Therefore, we have A(v + w) = Av + Aw. 0 


Problems 10.3 and 10.4 together tell us that the function that a matrix represents is a linear function. Letting 
f be one such function, this means that f(av) = af (v) for all scalars a and vectors v, and f(v + w) = f(v + w) for 
all vectors v and w. As we'll show in Problem 10.19, any linear function that maps a vector to a vector can be 
represented as multiplication by some matrix. Therefore, studying the basic properties of matrices is equivalent 
to studying linear functions that map vectors to vectors. Since linear functions are part of many fields of study, 
matrices are an essential mathematical tool with many applications. 


eS 4 -3 xX 
Problem 10.5: Let A = E SAN B= 2 0 i and v = Cr 
(a) Find Av and Bv in terms of x and y. 


(b) Find the matrix C such that Cv = Av + Bv for all v. 


(c) LetP= ie a and Q = (= a Suppose we define P + Q such that (P + Q)v = Pv + Qv for all 


pa Pz 921 42 
matrices P and Q and all vectors v. What matrix equals P + Q? 


338 


10.1. WHAT IS A MATRIX? 


Solution for Problem 10.5: 


(a) We have 
ae (5 =) a 7 (sy) eal & a) G) 7 oan 


; +3 4x-3 
(b) Since Av + Bv = eo +( A = (3 OY), the matrix C such that Cv = Av + Bv for all v is 
y =% 4x — 2y 


ash mp 


(c) We use our first two parts as a guide: 


P()+ () A eee EG) 


pux + re Ge: d m 
Pax + pooy 921X + qooy 


E + pı2Y + q11X + ae 


pox + pzy + gax + gooy 


(pu +qu)x + (pi2 + T 
(Par + g21)x + (p22 + q22)y 
pu tgu pit a @ 
pa tqa pr2+922/ \y/- 


So, we see that we should define P + Q as 


P+Q- @ a n @ ie) = A +411 p12 i 
p21 p22 921 422 P21 +421 Pz + G22 


[m] 


We therefore see that adding matrices works just like adding vectors: we add two matrices by adding the 
corresponding entries in the two matrices. 


|| __txercises Ea 
10.1.1 Find the following products: 
3 -4 3 
ey o P) o 
-1 0 -1 
(b) ( 4 a) ( 3 ) 
10.1.2 We define the difference of two matrices A — B to be the matrix D such that D + B = A. Show that if 


A= a and B = E biz a then 
azn ar bo, bn)’ 


A-B= a -bu a2 — c] 


an — bz, an -—bz/ ` 
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6 -4 
10.1.3 Let A = (3 BF 


(a) Find A ( “,) and a (3). 
=9 —1 
2 on ee 
(b) Are a and e linearly dependent? 
(c) Are your results in part (a) linearly dependent? 
(d) If your answers to (b) and (c) are the same, then solve those parts again. If they are different, is it just a 


coincidence? If you choose any two vectors to replace ea and (zy) in part (a), will your answer to part 


(c) still be the same? 


10.2 Multiplying Matrices 


In the previous section, we found that matrix addition works in the way we might expect: each entry in the sum 
of two matrices is the sum of the corresponding entries in the two matrices. In this section, we learn how to find 
the product of two matrices, and find that the process isn’t nearly as straightforward as adding two matrices. 


I Ero eae - 


Problem 10.6: Consider the system of equations 


Ax+3y=7, 
2x —9y = 5. 


Suppose we wish to write the system in terms of the variables u and v, where x = 2u — v and y = 3u + v. 
(a) Write the original system in the form A a = 


(b) For what matrix B do we have P =B G 


(c) Use substitution to write the original system of equations in terms of u and v rather than x and y. 


Suppose we define the matrix product AB so that for any vector w, we have (AB)w = A(Bw). What 
is AB? 


Problem 10.7: Find a matrix that equals AB, where 
he ie a ie A = es 
an a2 bor bz 
Problem 10.8: Perform the following matrix multiplications: 
E 2 ) k A 
ORE o ales 


Gale AeA) 
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Problem 10.9: Suppose A and B are 2 x 2 matrices. 


(a) Must we have A + B = B + A? 
(b) Must we have AB = BA? 


Problem 10.10: 
(a) Is matrix multiplication associative? In other words, is it always true that A(BC) = (AB)C? 


(b) Is matrix multiplication distributive over addition? That is, is it always true that A(B + C) = AB + AC 
and (B + C)A = BA + CA? 


Problem 10.11: 


(a) IfA G) = ray then must A = ié le 
(b) IfA G) = o and A a = (78°) then must A = F 4) 


() IfA o = A and A ( si = (|), then must A = i 1)? 


Problem 10.12: Suppose v and w are linearly independent and A and B are matrices such that Av = Bv and 
Aw = Bw. 


(a) Why must Ai = Bi? 
(b) Show that A = B. 


Problem 10.6: Consider the system of equations 


4x + 3y =7, 
249 — 5. 


Suppose we wish to write the system in terms of the variables u and v, where x = 2u — v and y = 3u +v. 


(a) Write the original system in the form A (3 


For what matrix B do we have o =B a ? 


Use substitution to write the original system of equations in terms of u and v rather than x and y. 


Suppose we define the matrix product AB so that for any vector w, we have (AB)w = A(Bw). What 
is AB? i 


Solution for Problem 10.6: 


(a) We have C =) C) = a soA = G A 


(b) We have Y = 6 z (“), 508 = G al 


(c) Substituting x = 2u—v and y = 3u+v into the original system of equations, and then doing some simplifying, 
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gives the system 


17u -—v =7, 
=23m = llo = 5, 


Iy =1) (x 7 
We can express this system as IE Ba (*) = (7). 


(d) Substituting C) = G ce C) from part (b) into G D) G) = (7) from part (a) gives 


G “) (3 7) (2) =). 


Comparing this to part (c), it appears that we should define matrix multiplication such that 


C a G D a (Ss a 


Notice that this is not what simple intuition might suggest. 


We do not multiply two 2 x 2 matrices by multiplying corresponding entries. 


WARNING!! 
ae 
For example, 
(3 mie e er 
2 Oe i 56 were) Wh as 


Let’s see how matrix multiplication does work, given that we wish to define matrix multiplication so that 
(AB) w = A (Bw) for all w. 


Problem 10.7: Find a matrix that equals AB, where 


Ne ie ol seg B= ine rA l 


anı a2 ba bz 


Solution for Problem 10.7: We let w = (s) , and we have 


_ by a) (“)) 
Beanies i bo.) \v 
af ea a) (Ge + A 
an = a2/ \bnu + bo20 
= Co + aybi + abau + oe 
azıb11u + A21b420 + Ano bz u ap Aro b220 
a o + anba )u + (aiibi2 + a 
(421041 + a22b21)u + (a21b12 + a22b22 V 


a cae + a2b1 ayubi + te) a 
anbi + A22b21 a21b12 + a22b22/ \v 


ayıb + a42b2, anbi + anb 
Therefore, we have AB = ( 11911 T 412921 411912 + 412022 \ 


axb11 + A22b2, 4a21b12 + Arb 
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Important: 


We define the product of two 2 x 2 matrices as -E 


ae ) K A = Ge +anbza anbi ee) | 


an 22) \ba ba anbi + Az2b2, An b12 + Argbo2 


We can remember how to multiply matrices by viewing each column of a matrix and each row of a matrix as 


by b 
a vector. The column vectors of ( a 2 are ca and ear Similarly, the two rows of a matrix A are the 
by, bz bz bn 


! : ; an a 
row vectors of A. We typically write row vectors horizontally, so the row vectors of ie i) are (a a2) and 
21 422 
(an 2). 


With this in mind, we can remember how to perform matrix multiplication by noting that the entry of row 
i and column j of AB is the dot product of row i of A and column j of B. For example, the entry in row 1 and 
column 2 of AB is 


b 
(an a12) . Ce) = Aanby + Ay2b2. 


If we let the rows of A be the vectors a; and az, and the columns of B be the vectors b; and bz, we can therefore 
express the product AB as 
a _ [a;b ee 
oe ( a2 ) (b: ba) . a az: b2/ ` 


We can also remember this matrix product by noting that each column of AB equals A times the corresponding 
column of B. That is, we can view the product as: 


AB=A (b: bz) = (Ab, Ab») . 


[Problem 10.8: Perform the following matrix multiplications: a 


| 


o (% to) s) o  3)(F 3) 


B Problem 10.8: (2 l pea (-3 2) G (3 2) e 
-6 -10/ \-1 -5 (-6 -10) , A (-6 -10) ; e 
= (—3)(0) + 2C)  (-3)0) + 2X(-5) ) 
(—6)(0) + (-10)(-1)_ (-6)(1) + (-10)(-5) 
=(i9 a): 
P GIG 2) = (@casoay w00) l ia) 


o 
Viewing matrices as representing functions, the matrix product AB is a composition of functions. The product 


A (B 4) tells us to apply the function that B represents to o , and then apply the function that A represents 
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to the result of B o . As we have just seen, we can represent the product AB as a single matrix, and this matrix 
x 
represents the composition of the functions A and B represent. So, the product (AB) (*) tells us to apply the 


i : re x 
result of this function composition to G ; 


Problem 10.9: Suppose A and B are 2 x 2 matrices. 


(a) Must we have A +B = B + A? 
(b) Must we have AB = BA? 


Solution for Problem 10.9: 
(a) Yes. We have 


A+B= S a) r & A i a +by a2 + a 


A21) 422 ba ba an + by an + bz 
= bu +411 biz + 412 = bu biz 4414 412\ _ 
= = + =B+A. 
bz +21 bn +az by bn az, 422 


(b) The non-intuitive definition of matrix multiplication might make us suspect that AB is not necessarily equal 
to BA. We can test it by looking at one of the examples in the previous problem. For example, we found 


that 
G3) 3)= Gs a) 
4 3/\1 -2/ 7 \-13 14/° 
Reversing the order of the matrices in the product, we have 


G DG JEER WD- G 2) 


So, on our first try we found a pair of matrices A and B such that AB and BA are not equal. 


 WARNING!! Matrix multiplication is not commutative; the products AB and BA are not 
E necessarily equal. 


Sidenote: When we think of a matrix as representing a function, it’s not surprising that 


matrix multiplication is not commutative. The product AB (*) =A (B (3) 


tells us to apply the function that B represents to (*) , and then apply the function 


that A represents to the result of B (*). The product BA E =B (a (*)) 


has us apply these functions in the reverse order. But we saw on page 17 that 
function composition is not commutative; if f and g are functions, then f o g is not 
necessarily the same as go f. 


` 


344 


10.2. MULTIPLYING MATRICES 


| Problem 10.10: 
(a) Is matrix multiplication associative? In other words, is it always true that A(BC) = (AB)C? 


(b) Is matrix multiplication distributive over addition? That is, is it always true that A(B + C) = AB + AC 
and (B + C)A = BA + CA? 


Solution for Problem 10.10: 
(a) We have 


A(BC) = a o o + by2¢2, buci + aaa 
A, 22) \bacu +bzc2 baci + bo2e2 
= eo +aybizca +anbacy +apbzca anbync + a1b12022 + a12b21012 + fe 
7441011 + Ay1b12C21 + Agabo1C11 + A2Qb22Co1 a21b11612 + Aor b12C22 + Azabo1C12 + A22b22C22 


and 


(AB)C = ee +apbn b12 + ate) a a 
anbu +anbz arbi + a22b2/) \co Cx 
a (ey + Ayob71C11 + a11b12C21 + A32b22C21 11 b11C12 + 412b21C12 + A11b12C2 + perc) 
A21b41.C11 + Az2bo1C11 + a21b12C21 + Aogbe2¢21 a21b11C12 + a22b21C12 + ao1b12C22 + A22b22C22/ ` 


With a little tedious inspection, we can see that these expressions for A(BC) and (AB)C are indeed the same. 
Therefore, we can simply write ABC rather than writing A(BC) or (AB)C. 


Our proof that A(BC) = (AB)C is not particularly illuminating. We can find a more intuitively satisfying 

explanation by starting with systems of equations: 

fy = Ayu + ay2U2, Uy = buv + bizva, Vy = CW, + Cw, 

ty = Any + A222, Uz = buv + bz, Vz = C1 W1 + C2W2. 
We can view these systems as t = Au, u = Bv, and v = Cw. Through a pair of substitutions, we can 
combine these systems to express tı and f in terms of w; and wz. No matter which order we perform these 
substitutions, we'll get the same system of equations in the end. If we substitute the expressions for u and 
uz from the second system into the first, and then substitute the expressions for vı and vz into the result, we 
get t = (AB)(Cw). If we instead first substitute the expressions for vı and v2 from the the third system into 
the second, and then substitute the result into the first, we get t = A(BCw). Either way, we get the same 
system. 


(b) This one’s a good deal easier. We have 
_ fan aņ)\/bu+cu b+ a 
MB a Che e a S +>] bz +c 


k En + 441011 + A12b21 + 12CC21 A11b12 + A1112 + A12b22 + -a 
anbi + 9111 + Ao2bo1 + a22C271 Aa by + A21C12 + A22b22 + A220272 

7 eee +ab2 anbi + Be. (eae! + €12Cr1  @41C12 + ae 
anbi + ay2b21 421b12 + a22b22 A21C11 +422C21 421C12 + A22C22 


= AB + AC, 


and similar computations show that (B + C)A = BA + CA. (Note that we must confirm A(B +C) = AB + AC 
and (B + C)A = BA + CA separately because matrix multiplication is not commutative.) 
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Important: Matrix multiplication is associative and distributive. So, for all 2 x 2 matrices 


Vv? A, B, and C, we have 


A(BC) = (AB)C, 
A(B+C) = AB+AC, 
(B+C)A = BA + CA. 


m nn NR ' 
(eeno Again, we can use the fact that a matrix represents a function to understand why 
| A matrix multiplication is associative. For any three functions f, g, and h, we have 


fogh) =f g) ok 


If matrices A, B, and C represent functions f, g, and h, respectively, then we have 


A(BC) = (AB)C. 


Problem 10.11: 
(a) IfA i) = ( 0 ), then must A = G Ak 


) and A =) = Get then must A = © a 


17 


| 


| 


-1 Bu Z 
= ? 
( 5 ), then must A f a ? 


Solution for Problem 10.11: Let A = & D so A 6) = ie F F 
Az, a2 2 Ay + 2422 


(a) We must have ay + 2a = 17 and az + 2a» = 0. We have four variables among these two linear equations. 
Choosing a12 = 7 and az = —1, and then solving for ay; and az, gives us A = < F . But there are other 
possibilities for A, since these aren’t the only possible values we can choose for 4,2 and az. The solution 


A= es A is one particularly obvious possibility for A. Choosing a12 = 1 and az. = 2, and then solving 


for a1; and az, gives us A = e A as another possibility. For any choices of 412 and 422, we can solve for 


A Pa 1 1 ee oe 
ayı and az; to produce a matrix that satisfies A (3) = ( R , SO there are infinitely many possibilities for A. 


(b) Now, we have four equations for our four variables. As in part (a), we have 
āu + 2412 = 17, á 


Ay, + 2aāņ = 0. 


From A & = F a) , We also have 


—2011 z 4ay> = —34, 
—2Ay, — 4ax = 0. 


Adding twice a1; + 2412 = 17 to —2a,, — 4a12 = —34 gives us 0 = 0. Uh-oh. The same happens when we 
add twice a1 + 2a22 = 0 to —2a71 — 4a22 = 0. The second system of equations above is just —2 times the first 
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system! Since the new pair of equations is just the same as the pair of equations we already had, all the 
solutions in part (a) are also solutions here. Once again, there is not just one possible matrix A. 


(c) Again, we must have ay; + 2412 = 17 and a21 + 2427 = 0. From A a = S we have 2411 — 412 = —1 and 


2a21 — An = 5. The equations 243; — A12 = —1 and ay, + 2a12 = 17 give us 411 = 3 and an = 7, and the other 


two equations give us A2 = 2 and aņ = —1. So, in this part, we must indeed have A = 4 ; 


What made part (c) different from the other two parts? It isn’t surprising that knowing what Av equals for 
one vector v isn’t enough to figure out what A is, but why can we figure out what A is in part (c), but not in part 


(b)? Looking more closely at (b), we see that the two vectors multiplied by A are 6 and = , which are not 


linearly independent. But the two vectors multiplied by A in part (c) are linearly independent. Maybe that’s the 
key? Let’s see. 


[Problem 10.12: Suppose v and w are linearly independent and A and B are matrices such that Av = Bv and 
Aw = Bw. 
(a) Why must Ai = Bi? 


| (b) Show that A = B. 


Solution for Problem 10.12: 

(a) In Problem 9.21, we showed that if v and w are linearly independent, then every point (x, y) is in the graph 
of the parametric equation a = pv + qw, where p and q are parameters. Therefore, for any vector u, there 
are scalars p and q such that pv + qw = u. Specifically, there are scalars r and s such that rv +sw = i. Adding 
r times Av = Bv to s times Aw = Bw gives rAv + sAw = rBv + sBw, which we can manipulate as follows: 

rAv + sAw = rBv + sBw 
= A(rv)+ A(sw) = B(rv) + B(sw) 
= A(rv + sw) = B(rv + sw). 
We also know that rv + sw = i, so we have Ai = Bi. 


(b) In Problem 10.2, we saw that the first column of A is Ai. Therefore, part (a) tells us that the first column of 
A is the same as the first column of B. By a similar argument to part (a), we can show that Aj = Bj, so the 
second column of A is the same as the second column of B. Since A and B have the same first column and 
the same second column, we have A = B. 


If v and ware linearly independent and A and B are matrices such that Av = Bv 
and Aw = Bw, then A = B. 


‘Important: 


10.2.1 Compute the following products: 


» & 4G 4 © Co o) 3) 
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ay 


0 : ) . Show that the product of two diagonal matrices is 
2 


10.2.2 A 2x2 matrix is diagonal if it is of the form ( 
also diagonal. 


10.2.3 Is there a matrix F such that for any matrix A, the matrix FA is the matrix formed by swapping the rows 
of A? If such a matrix exists, what is AF for that matrix? 


10.2.4 If AB = 0, then must either A or B be 0? 
4 -3 -3 0 4 -3 
a sX : > 
10.2.5 Let T (%) = (]°) ana (7°) = (3). whatis T ( %, A 
10.2.6 In Problem 10.11, we showed that there is more than one matrix A for which we have A (3) = E and 


A (T = i) Our first step in doing so was letting A = a a and building systems of equations by 


multiplying out the left sides of the given equations. Did we have to do that in order to see that the two given 
equations are really the same equation? 


10.3 Matrices as Transformations 


| _ Problems (dg 


Problem 10.13: a = 
(a) Find a matrix I such that multiplying any vector by I leaves the vector unchanged. In other words, find 
I such that Iv = v for all vectors v. Is this matrix unique? 
(b) Do we have IA = A for any matrix A? 


Problem 10.14: Find the matrix A such that Av= kv for all vectors v, where k is a scalar. 


Problem 10.15: In this problem, we show that there is a matrix N such that Nv is a 90° counterclockwise rotation 
of v for all v. 


a) Letv= * \. Find the image of the point (x, y) under a 90° counterclockwise rotation. Use your result to 
y 8 p l y 


find the desired matrix N. 


(b) Use part (a) to show that a 90° counterclockwise rotation is a linear transformation. In other words, show 
that the image of a 90° counterclockwise rotation of cv is c times the 90° counterclockwise rotation of v, 
and the image of a 90° counterclockwise rotation of v + w is the sum of the 90° counterclockwise rotations 

of v and w. 


Problem 10.16: In this problem, we find a matrix T such that the vector Tv is a counterclockwise rotation of the 
vector v by an angle ¢. 


(a) Letv= X). Suppose the point P is the image upon rotating the point (x, y) an angle @ counterclockwise 
y PP p ge up Beep Y 8 


about the origin. What are the coordinates of P? 
(b) Find the desired matrix T. Test your matrix by finding T for @ = 0° and ¢ = 90°. 
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Problem 10.17: A sequence (a1, b1), (a2, b2), (a3, b3), ... of points in the coordinate plane satisfies 


(4n41, bn41) = (VBa, =o (ry V3bn + Ay) 
for n = 1,2,3,.... Suppose that (a100, b100) = (2,4). What is a; + b1? (Source: AMC 12) 


Problem 10.18: Let v = G) and z be the complex number x + yi. 


Re(zi) 
Im(zi) 
C corresponds to multiplying z by i. 


(a) Find the matrix C such that Cv = ( ) . In other words, find the matrix C such that multiplying v by 


(b) Find the matrix D such that multiplying v by D corresponds to multiplying z by a + bi. (Your answer 
should be a matrix whose entries are in terms of a and b.) 


Problem 10.19: As a reminder, we say that a function f with domain R? is a linear function if 
(i) f(av) = af(v) for all scalars a and vectors v, and 
(ii) f(v + w) = f(v) + f(w) for all vectors v and w. 
Suppose g(w) is a linear function that maps vectors to vectors. 


(a) Explain why g(xi + yj) = xg(i) + yg()). 
(b) Why does part (a) tell us that there exists some matrix A such that g(w) = Aw for all w? 


Problem 10.13: 
(a) Find the matrix I such that multiplying any vector by I leaves the vector unchanged. In other words, 
-~ find I such that Iv = v for all vectors v. Is this matrix unique? 


(b) Do we have IA = AI = A for any matrix A? 


Solution for Problem 10.13: 


(a) From Problem 10.2, we know that the first column of I is Ii. Since Iv = v for all v, we have Ii = i, so the first 
column of I is i. Similarly, we have Ij = j, so the second column of I must be j. So, if there is a matrix I such 
that Iv = v, then it must be 


We aren't quite finished! We have only shown that if there is a matrix such that Iv = v, then it must be 
= G 4 We have not shown that this matrix satisfies Iv = v for all v; we’ve only shown that Iv = v 


when v is i or j. Fortunately, we can patch this hole very quickly: 


o DGG 


so G 1) v = v forall v. 
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Make sure you see why the second part of our solution is necessary. Our first 
step of considering Ii and Jj only tells us that what the columns of I must be 
if a matrix with the desired property exists. We need the last part of the solution 

to show that the matrix thus constructed does indeed satisfy Iv = v for all v. 


-WARNING!! 
ry 


(b) First, we show that IA = A. The first column of IA is the product of I and the first column of A, which is 
just the first column of A. Similarly, the second column of IA must be the second column of A, so IA = A. 
We also could have written out both matrices to confirm: 


e i ie a) " a A 
0 1/ an an Ax a22) 


For yet another solution, note that part (a) gives us I(Av) = Av for any matrix A and vector v. We also 
have I(Av) = (IA)v. Therefore, we must have (IA)v = Av for all vectors v, so IA = A. 


To show that AI = A, we can again write out both matrices: 
@ a 4 = te a 
49, a2 0 1 A21 A22 i 


We could also again use part (a). We have (AI)v = A(Iv) = Av for all vectors v, so we have AI = A. 


Important: The matrix 


Y 


is called the identity matrix. Multiplying a vector v by I leaves the vector 
unchanged; in other words, Iv = v for all vectors v. We also have IA = AI = A 
for any 2 X 2 matrix A. 


Problem 10.14: Find the matrix A such that Av = kv for all vectors V, where k is a scalar. 


Solution for Problem 10.14: Solution 1: Equate corresponding components. Performing the multiplications on both 


sides of 2 n (2 7 (2 


ae _ a 
ex+dy)/  \ky/ ` 


From ax + by = kx, we havea = k and b = 0, and from cx + dy = ky, we have c = 0 and d = k. Checking, we see that 
To 
0 k/\y/) \ky/) W 


Solution 2: Apply the result of Problem 10.2. We must have Av = kv for all v, so we must have Ai = ki and Aj = kj. 
Therefore, if there is a matrix A such that Av = kv for all v, then the first column of A must be ki and the second 


must be kj. We can quickly confirm that ¢ 4 (;) =k C) ,soA= È ; is the desired matrix. 0 


So far, we have discovered a matrix that does nothing (the identity) and a family of matrices that scale vectors. 
Now, we turn to matrices that rotate vectors. 


gives 


for all x and y, as desired. 
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Problem 10.15: In this problem, we show that there is a matrix N such that Nv is a 90° counterclockwise 
rotation of v for all v. 


(a) Letv= (Ih Find the image of the point (x, y) under a 90° counterclockwise rotation. Use your result 
to find the desired matrix N. 


(b) Use part (a) to show that a 90° counterclockwise rotation is a linear transformation. In other words, 
show that the image of a 90° counterclockwise rotation of cv is c times the 90° counterclockwise rotation 
of v, and the image of a 90° counterclockwise rotation of v + w is the sum of the 90° counterclockwise 
rotations of v and w. 


Solution for Problem 10.15: 


(a) What’s wrong with the following solution: 


A 90° counterclockwise rotation of i is j, and a 90° counterclockwise ro- 
tation of j is —i. The product Ni equals the first column of N, so the first 
column of N is j. Similarly, the second column of N is Nj, which equals. 


: M 
—i. Therefore, N = ( 0 i 


Bogus Solution: 


Bie 


This bogus solution does find the correct matrix. Moreover, the bogus solution shows that if the desired 
Mer 0 =i , ' f 
matrix exists, then C 0 ) must be that matrix. However, the solution does not show that Nv is a 90° 


counterclockwise rotation of v for all v. 


Our first step in testing that Nv is a 90° counterclockwise rotation of v for all v is to figure out, in terms 
of x and y, what vector results when v is rotated 90° counterclockwise. We let P be (x, y) and P’ be the image 
of P upon a 90° counterclockwise rotation. Rotation is often easier to analyze with polar coordinates. We 
let the polar coordinates of P be (r, 8), so the polar coordinates of P’ are (r, O + 90°). Converting these to 
rectangular coordinates, the rectangular coordinates of P are x = r cos @ and y = r sin 0, and the rectangular 
coordinates of P’ are 


x’ = rcos(@ + 90°) = r (cos 0 cos 90° — sin 0 sin 90°) = -r sin 0, 
y =rsin(@ + 90°) = r (sin 0 cos 90° + sin 90° cos 0) = r cos 0. 


Comparing our expressions for x’ and y’ to those for x and y, we find that x’ = -y and y’ = x, so the 
rectangular coordinates of P’ are (—y, x). 


Now we can check that the matrix we found in the Bogus Solution does indeed produce the desired 


1 0 y x i 


This matches our work above and thereby tells us that (| z v is indeed a 90° counterclockwise rotation 


of v for all v. 


(b) Weshowedearlier that the function represented by a matrix is a linear function. That is, we have N(cv) = cNv 
and N(v + w) = Nv + Nw for any scalar c and any vectors v and w. The fact that we can represent a 90° 
counterclockwise rotation of a vector as multiplication by a matrix N tells us that such a rotation is linear, 


too. O 


Important: Ifa function that maps a vector to another vector can be represented bya matrix, 
Y then the function is linear. 
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We'll now generalize Problem 10.15. 


Problem 10.16: Given an angle @, find a matrix T such that for any vector v, the vector Tv is a counterclockwise 


rotation of v by œ. 


Solution for Problem 10.16: We use our solution for a 90° rotation as our guide. We let P be (x, y) and P’ be the 
image of P upon a counterclockwise rotation by angle @. Rotation suggests polar coordinates, so we let (r, 0) be 
the polar coordinates of P, which means that x = rcos 0, y = r sin 0, and the polar coordinates of P’ are (r, 0 + @). 
If (x’, y’) are the rectangular coordinates of P’, then we must have 


x’ = rcos(0 + ġ) = rcos O0 cos ġ -rsin 0 sing = xcos¢ — ysin Q, 
y = rsin(0 + ġ) = rsin 0 coso +rcos Osing = y coso + xsin®g. 
We could now find the columns of T by finding Ti and Tj, and then confirm that the resulting matrix produces 


the desired expression for Tv for any v. Or, we could simply consider the product T G) and compare it directly 


to the desired result. Taking the latter approach, we seek the matrix such that 


( 4 (*) _ a 2 @ cos@ — ysin a 
c d) \y) \y’/ \xsing + ycoso/ ` 
Therefore, we must have ax + by = x cos ọ — ysin dg and cx + dy = xsin ọ + y cos d, which means our desired matrix 
is 
ie @ -sin 
sind coso /` 
O 


We can test that our matrix produces the correct result when } = 0° and ¢ = 90°. A rotation of 0° is the identity 
transformation—nothing is changed. Sure enough, setting @ = 0° gives us 


P 0° -sin a _ (; w 
sin0° cos0°/  \0 1” 
which is the identity matrix. 

When @ = 90°, we have 


e ma T) 
sin90° cos90°/ \1 07” 


which is the matrix we found in Problem 10.15 for a 90° counterclockwise rotation. 


a cass)? 


then Tv is a counterclockwise rotation of v by an angle of @. 


Problem 10.17: A sequence (a1, bi), (a, by), (a3 j b3), . _. of points in the coordinate plane satisfies 


(Ans1, Dnt) = ( V3an = Dyp V3b, ap An) 


for n = 1,2,3,.... Suppose that (a100, b100) = (2,4). What is a1 + b1? (Source: AMC 12) 
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Solution for Problem 10.17: The ordered pair ( VBan — bn, V3bn + an) is a linear combination of the components of 


v3a by 


the ordered pair (an, bn). This means that we can represent ( Vb, be ) as the product of a matrix and a 
n n n 


(ae z V3an — bn = v3 = A 
Dist V3b, + Ay, 1 v3 bn) 
The V3 entries make us think of rotation, but the entries clearly cannot all be cosines and sines because cosines 


and sines cannot be greater than 1. However, dividing all the entries in the matrix by 2 (factoring the scalar 2 out 
of the matrix) gives us some very familiar fractions: 


cal Ge ag o) 


Now, we're in familiar territory, and we see that 
Ge) > en 30° -sin A a 
bnii/ ~ \sin30° cos30° ba) ` 


cos30° —sin30°\ (a, 

Mhie product ~ 30° cos30° ) & 
_ /cos30° -sin30° _ (an 

T= Ghee cos 30° ) and v, = ioe we have 


V100 = 2T v99 = 2T(2T vos) = 29 T- T vı. 
99 T’s 


Specifically, we have 


An 


a . Letting 


) gives us a vector that is a 30° counterclockwise rotation of ( 


Just as we use an exponent to denote a repeated product of any number, we can write the product of 99 T’s as Ti 


so we have 
V100 = 2T? v. 


We know that Tv; is a 30° counterclockwise rotation of vi, So Tv, is a 12 - 30° = 360° rotation of vı. Therefore, 
we have Tv] = vı, which means that T!? =I. Since 99 = 3 + 12-8, we have 


8 
Vio = 297 y, = 299-73 T) ns 29Tv]. 


Since Tv; is a 30° counterclockwise rotation of v1, the product Tv; gives us a vector that is a 90° counterclockwise 
rotation of v;. So, we have 


_ „99 (608 90° Tene (a _ 499 G a Ca Lees ee 
u e Cae cos90° / \b; ma uy bı aie a, )’ 


which means Gal = E 


b i . We are given that a109 = 2 and bio9 = 4, so 
100 


p, = 210 awo _ 4-2_ 1 
a1 +0 = 99 — 599 ~ 999 ~ 998" 


Problem 10.18: Let v = C) Find the matrix D such that multiplying v by D corresponds to multiplying 


x + yi by a + bi. (Your answer should be a matrix whose entries are in terms of a and b.) 


a OEE O U O S a 
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Solution for Problem 10.18: We have 


(a + bi)(x + yi) = (ax — by) + (bx + ay)i. 


D a E n l 
y bx + ay 


From this, we can. easily read the components of D: 
i a (*) i a A 
l a y bx | ay l 


We've seen now that a great many different transformations of vectors can be represented by matrices. Now, 
we turn to our claim on page 338 that any linear function that maps vectors to vectors can be represented as 
multiplication by a matrix. 


Therefore, we must have 


Problem 10.19: | As a reminder, we say that a function f with domain IR? is a linear function if 


(i) f(av) = af(v) for all scalars a and vectors v, and 
(ii) f(v + w) = f(v) + f(w) for all vectors v and w. 
Suppose g(w) is a linear function that maps vectors to vectors. 


(a) Explain why g(xi + yj) = xg(i) + yg(j). 
(b) Why does part (a) tell us that there exists some matrix A such that g(w) = Aw for all w? 


Solution for Problem 10.19: 

(a) Applying item (ii) in our definition of a linear function, we have g(xi + yj) = g(xi) + g(yj). Then, we apply 
item (i) twice to find g(xi) + g(yj) = xg(i) + yg(j). Therefore, we have g(xi + yj) = xg(i) + yg(j). 

(b) We can write any vector v as 01+ vj for some scalars vı and v2. So, for any linear function g, applying part 
(a) gives us 

g(v) = gvi + vj) = vigli) + v2g(j). 

Since g maps vectors to vectors, there exist scalars 411, 421, 412, and a» such that g(i) = ani + anj and 
g(j) = ani + anj. Therefore, we have 


g(v) = vi(ani + a21)) + v2(a121 + a72)) = (V1441 + V2A12)i + (V1A21 + V2A2)j. 


If you don’t see Av on the right side in this notation, perhaps you will if we write the right side like this: 
van + Bia ee i (5) 
v)= = l 
st) ee + 02422 A21 A22) \U2 


Letting A = a A , we have shown that there is a matrix A such that g(v) = Av for all v. The columns 
21 Ar 


of A are g(i) and g(j), so A is uniquely determined by g. Therefore, we have proved the following: 


Important: | Any linear function that maps vectors to vectors can be expressed as multipli- 
W cation by a matrix, where the columns of the matrix are given by the vectors to 


which i and j are mapped by the function. 
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The final observation above gives us a very easy way to re-create the matrices we discovered in this chapter 
whenever we need them. For example, now that we know that rotation is a linear transformation, to find the matrix 
N that corresponds to a 90° counterclockwise rotation, we only need to find Ni and Nj. A 90° counterclockwise 
rotation of i is j, and a 90° counterclockwise rotation of j is —i. Therefore, the first column of N is j and the second 
one O =! ae ; , 
is —i, which means N = ç 0 i Similarly, we can determine the effect of any linear transformation on any 


vector by examining the effect of the transformation on the easy-to-handle vectors i and j. 


10.3.1 Suppose that A (D = e Must A be the identity matrix? If not, find a possible matrix A besides I. 


10.3.2 Suppose a function g maps the vector v to the reflection of v over the x-axis. Is there a matrix A such that 


g(v) = Av for all vectors v? If so, find it. If not, explain why there is not. 


2 
10.3.3 Suppose a function f maps the vector C) to the vector (7a) . Is there a matrix A such that f(v) = Av for 


all vectors v? If so, find it. If not, explain why there is not. 


10.3.4 Show how to use the rotation matrix we discovered in Problem 10.16 to recall the angle sum identities for 
sine and cosine. 


10.3.5 Suppose a point P is reflected over the x-axis and the image is rotated 90° counterclockwise about the 
origin. Is the result the same as if we first rotate P by 90° counterclockwise about the origin, and then reflect the 
result over the x-axis? 


10.4 The Determinant 


We introduced 2 x 2 matrices with a system of two linear equations with two variables. We start this section by 
considering when it is possible to find a unique solution to such a system. 


Problem 10.20: Leta, b,c, d, u, and v be constants in the system of equations 


ax + by =u, 
cx + dy =v. 


(a) Show that if ad — bc + 0, then the system has a unique solution (x, y). 


(b) Show that if ad — be = 0, then the system either has no solutions (x, y) or has infinitely many solutions. 


Solution for Problem 10.20: 
(a) We eliminate y by multiplying the first equation by d and the second by b to get 


adx + bdy = du, 
bcx + bdy = bv, 


and then subtracting the second equation from the first to find 


adx — bcx = du — bv. 
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(b) 


Factoring the left gives (ad — bc)x = du — bu. Since ad — bc + 0, we can solve for x by dividing both sides to 
find 
_ du — bv 
= d- bc 


Similarly, we find y by subtracting c times ax + by = u from a times cx + dy = v to get 
(ad — be)y = av — cu. 
Once again, we need to have ad — bc + 0 in order to divide both sides by ad — bc to produce 


_av-cu 
y= ad — bc 


Therefore, if ad — bc # 0, the system has the unique solution (x, y) = (2 a L 


Before tackling the proof that there are either no solutions or infinitely many solutions, we'll get some 
geometric intuition for what’s going on. The graphs of the two equations in the system are lines with 


ar =p -d ; : E 
direction vectors ( P ) and ( i i respectively. These lines either don’t intersect (and the system has no 


solutions) or are the same line (and the system has infinitely many solutions) if and only if (T Zk w 


for some scalar k, or if c = d = 0. In either case, we have ad — bc = 0, since b = kd and a = kc gives 
ad — bc = kcd — kcd = 0, and c = d = 0 gives ad — bc = 0. 


Now, let’s take a more rigorous approach. In part (a), we showed that the system of equations implies 
the equations 


(ad — bc)x = du — bv, 
(ad — bc)y = av — cu. 


If ad — bc = 0, then these equations become 


0 = du — bv, 

0 = av — cu. 
So, if du — bv + 0 or av — cu + 0, then the corresponding equation above cannot be true, which means the 
system has no solutions. If du — bv = 0 and av — cu = 0, then we have du = bv and av = cu. To see what 


this means for our original system of equations, ax + by = u, cx + dy = v, we multiply the first by v and the 
second by u, to get 


avx + buy = uv, 
cux + duy = uv. 


If du = bv and av = cu, then these two equations are the same, so any of the infinitely many solutions (x, y) 
to the first equation are also solutions to the second. If u and v are nonzero, then dividing the first equation 
above by v and the second by u shows that these infinitely many values of (x, y) are solutions to the original 
system. With some careful casework, we can show that even if u or v is 0, then these infinitely many values 
of (x, y) are still solutions to the original system. 


Therefore, if ad — bc = 0, the system 


ax + by =u, 
cx +dy =v, 


either has no solutions or it has infinitely many solutions. 0 
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We can write the original system of equations in Problem 10.20 as 
(Ca) =) 
c d) i) ao 
Our work in the solution shows that this system has a unique solution G) if and only if ad — bc + 0. This is such 


an important property that for any matrix ( s) , we have a special name for the quantity ad — bc. 


Definition: The determinant of the matrix : a is the quantity ad — be. We often denote this determinant 


a b 
A = ad — be. 


We also can denote the determinant of a matrix A as det(A). 


In Problem 10.20, we proved that if ad — bc + 0, then the system of equations 


ax + by =u, 
cx + dy =v, 
has the unique solution (x, y) = (= — z, ~ - =e) . We can write this solution in terms of determinants: 
u b a u 
v d c v 
X= ; = e 
a b y a b 
c d c d 


If we express the system of equations as M ie) = AT where M = é A) we see that each denominator in 
the solution to the system is det(M). The numerator of x is the determinant of the matrix formed by replacing 
the first column of M (which corresponds to the coefficients of x in the system of equations) with the column 
c . Similarly, we find the numerator of y by replacing the second column of M with c and then taking the 


determinant of the resulting matrix. This method for solving a system of two linear equations with two variables 
is referred to as Cramer’s rule. 


For the rest of this section, we will explore properties of the determinant. 


| Problems ig 


Problem 10.21: Solve each of the following systems of equations with Cramer’s rule: 


(a) 4x — 14y = 7, (b) 9a = 5b +7, 
—19x + 88y = —1. —16b +294 = —3. 


Problem 10.22: gae 
(a) Evaluate 47 


-2+4 3+7 —24+3-4 3+3-7|° 
4 7 4 TAE 


(b) Make a conjecture based on your results from part (a), and then prove your conjecture. 


4 
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Problem 10.23: 
(a) Suppose B = 2A. How are det(A) and det(B) related? 


(b) Let C be the matrix formed by doubling the entries in the first row of A. How are det(A) and det(C) 
related? 


157 199 
Problem 10.24: Evaluate 314 399 


jana |_ 33 Bh 


1200 3600) 


Problem 10.25: 


4 5 -3 7 
(a) Evaluate 3 7 and 4 A 


(b) Make a conjecture based on your results from part (a), and then prove your conjecture. 


Problem 10.26: 
(a) Find three matrices with nonzero components such that each has determinant 0. 


(b) Notice anything interesting about your matrices in part (a)? Specifically, how are the rows related to each 
other in a matrix with determinant 0? 


(c) How is your answer to part (b) related to what happens in a system of equations in which we cannot 
apply Cramer’s rule because the denominators would come out to be 0? 


Problem 10.27: Let A = ie i ) and B = ie Sere 


Qe Ss 6 4 
(a) Find det(A), det(B), det(AB), and det(A + B). 


(b) Make a conjecture based on your results from part (a), and then prove your conjecture. 


We start with a little practice computing determinants and applying Cramer’s rule. 


Problem 10.21: Solve each of the following systems of equations with Cramer’s rule: 


| @) 4x- 14y = 7, (b) 9a = 5b +7, 
| -19x + 88y = -1. -16b + 29a = -3. 


Solution for Problem 10.21: 


(a) Applying Cramer’s rule gives 


K a 
-1 88| _ (7)(88)—(-1)(-14) _ 602 


on | 4 a ~ (4)(88) —(-19)(-14) 86’ 
To a 
4 7 
E 4 aaao a, 
ae ae 86 > 36 oh 
-19 88 


(b) What’s wrong with the following start: 
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Bogus Solution: Cramer’s rule gives us 


Za B4 
_ |-3 29 -16 -3 
email ih k | 

—16 29 —16 29 


There are all sorts of things wrong with this start, and they all stem from blindly sticking the coefficients 
and constants into Cramer’s rule without first organizing the data in the problem in a way that makes 
Cramer’s rule easy to apply. In our expression of Cramer’s rule, we defined the coefficients such that all 
the variables are on the left sides of the equations and the constants are on the right sides. We also avoid 
mistakes by writing the variables in alphabetical order. Reorganizing the original system of equations in 


these ways gives us 
9a = 5b =7, 
29a — 16b = —3. 


Now, we apply Cramer’s rule to find 


7 5 
ag Fl _ (7-16) = CC) _ 112-15 _ on 


oF [ a = O16) (29-5) -144+ 145 — 


29 —16 
9 7 
_ {29 -3| _  (9)(-3) - (29)(7) a =e! 200 
~ 19 -5| (9)(-16)— (29)(-5) — 1 i 
29 -16 


B 
Now that we have a little practice in computing determinants, let’s explore some properties of determinants 
that can make evaluating them easier. 


—— 
| Problem 10.22: 


| (a) Evaluate aooe Ep rie padi GLI 


4 7\| 4 5 "|, and | 4 7 


(b) Make a conjecture based on your results from part (a), and then prove your conjecture. 


l 


Solution for Problem 10.22: 


(a) We have 
-2 3 
| 4 l = (-2)(7) - (4)(3) = -26, 
ie ik : | = (—2 + 4)(7) — (4) + 7) = —2(7) + 4(7) — 4(3) — 4(7) = -26, 


The latter two matrices are formed by adding a constant times the second row to the first row. Our 
results suggest that this procedure will always leave the determinant unchanged. Our writing out the 
computations of the latter two determinants in great detail gives us a pretty good clue why this will always 

happen. 
ne 
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(b) Now that we know what to look for, it’s easy to find. We wish to show that adding a multiple of one row 
to the other will not alter the determinant of the matrix. We have 


ae PA = (a+ ke0) = (06+ hd) = ad + ked = be ~ ked = ad = be = | E|, and 
i LG Neen a a a fl | 
c+ka da kb) = MG + kb) - (b)(c + ka) = ad + kab — be — kab = ad -bc = |e gj: 


Therefore, adding a multiple of one row of a 2 x 2 matrix to the other row of the matrix does not change the 
determinant of the matrix. 


As an Exercise, you'll show that adding a multiple of one column of a 2 x 2 matrix to the other column of the 
matrix leaves the determinant of the matrix unchanged. 


b | latkb b 
~ lce+kd d 


a b+ka 
c d+kc 


a — 


PE DE 


Important: a b 
c d | 


iE d E 


Problem 10.23: 
(a) Suppose B = kA. How are det(A) and det(B) related? 


(b) Let C be the matrix formed by multiplying the entries in the first row of A by k. How are det(A) and 
det(C) related? i 


Solution for Problem 10.23: Let A = i: a , 80 det(A) = ad — bc. 


(a) We have 
ka kb 


det(B) = det(kA) =|, gg 


| = (ka)(kd) — (kb)(kc) = k’ (ad — bc) = k’ det(A). 


Important: aeHon any 2 x 2 matrix A and any constant k, we have det(kA) = det(A). ; 


(b) We have 
ka kb 


det(C) = z 


| = (ka)(d) — (kb)(c) = k(ad — be) = k det(A). 


Similarly, multiplying the second row of A by any constant k gives a matrix with determinant k det(A), and 
multiplying either column of A by any constant k gives a matrix with determinant k det(A). 


Important: — Multiplying either column or either row of a matrix A by any constant k gives 


a matrix with determinant k det(A). 


O 


We can sometimes use the rules that we proved in the previous two problems to simplify the computation of 
the determinant of a matrix. 


Problem 10.24: Evaluate and 


157 199 33 =77 
314 399 —1200 3600)’ 
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Solution for Problem 10.24: For the first matrix, we notice that 314 is double 157, so we add —2 times the first row 
to the second row, and we have 


157 199 


157 199 157 199 
314 399| > 


Z e -2(157) 399 — 2(199)| ~ | 0 1 


| = (157)(1) — (0)(199) = 157. 


Turning to the second matrix, we see a common factor of 11 in the first row and 1200 in the second. Repeatedly 
applying the principle we proved in part (b) of Problem 10.23, we have 


roe me = Til bei a = (11)(1200) E l = (11)(1200)((3)(3) — (-7)(-1)) = 26400. 
oO 
Problem 10.25: 
(a) Evaluate is : and i il 


(b) Make a conjecture based on your results from part (a), and then prove your conjecture. 


Solution for Problem 10.25: 


É ; = (4)(7) — (-3)(5) = 28 + 15 = 43 and ae 4 


(b) It looks like swapping the rows of a 2x2 matrix A produces a matrix with determinant (-1) det(A). Proving 
this is true is straightforward: 


(a) Wehave 


| = (-3)(5) — (4)(7) = -15 - 28 = -43, 


c d 


| = cb —ad = ~(ad~ be) = | 


Important: For any 2x2 matrix A, swapping the rows or columns of A produces a matrix 
W with determinant (—1) det(A). : 


Problem 10.26: 
(a) Find three matrices with nonzero components such that each has determinant 0. 


(b) Notice anything interesting about your matrices in part (a)? Specifically, how are the rows related to 
each other in a matrix with determinant 0? 


(c) How is your answer to part (b) related to what happens in a system of equations in which we cannot 
apply Cramer’s rule because the denominators would come out to be 0? 


Solution for Problem 10.26: 


(a) With a little trial and error, we can find many matrices with determinant 0. Here are some: 
io ijl e 3a- le =s] || =_= 6 | _o 
Satie |2 3; J6 -10 |1 4i |25 -30 ` 


(b) In each of the example matrices in part (a), the first row is a multiple of the second. To see why this is 


= ad — bc. This equals 0 if and only if ad = bc, which we can write as £ = § 


a 
happening, we note that o 
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when cd + 0. When Ẹ = 2, the first row of the matrix é 5} is a multiple of the second. If c = 0, then we 


can only have ad — bc = 0 if a = 0 ord = 0. In either case, the second row is a multiple of the first. (Similarly, 
if d = 0, we have bc = 0, from which we find that the second row is a multiple of the first.) 


Therefore, the determinant of a 2 x 2 matrix is 0 if and only if one row is a multiple of the other. 
(c) We can only apply Cramer’s rule to the system 
ax + by =u, 
cx + dy =0, 
if ad — bc # 0. What goes wrong when ad — be = 0? As we saw in part (b), we have ad — be = 0 if and only 
if one row of the matrix a 1) is a multiple of the other. This corresponds to the left side of one of the 


equations in our system being a constant multiple of the other left side. Geometrically speaking, the graphs 
of the two equations are either the same line or parallel lines. So, we cannot have a unique solution to the 
system of equations. 


o 


Just as we can describe a pair of vectors as being linearly dependent if one is a constant times the other, we can 
describe two rows of a matrix as being linearly dependent if either is a constant times the other. 


Important: The determinant of a 2 x 2 matrix is 0 if and only if the rows of the matrix are 
vv linearly dependent. 


We can equivalently state this fact as, “The determinant of a 2 x 2 matrix is nonzero if and only if the rows of 
the matrix are linearly independent.” 


Matrices with determinant 0 are so important that we have a special name for them. A matrix with determinant 
0 is called a singular matrix, and a matrix with nonzero determinant is called a nonsingular matrix. 


| 
| -2 -5 6 4 
| (a) Find det(A), det(B), det(AB), and det(A + B). 


(b) Make a conjecture based on your results from part (a), and then prove your conjecture. 


Problem 10.27: Let A = ( l a and B = é y 


Solution for Problem 10.27: 
(a) We have det(A) = (1)(-5) — (7)(-2) = -5 + 14 = 9, det(B) = (2)(4) — (-7)(6) = 8 + 42 = 50, det(A + B) = 


0 
4 4 = —3, and 


44 7 
at 


damo] 22 


Diy =e | = 6((44)(-1) — (7)(—17)) = 6(75) = 450. 


ag À 


=2.3 | 
(b) In part (a), we don’t see any obvious relationship between det(A + B) and det(A) + det(B), but we do have 
det(AB) = det(A) det(B). Let’s see if this is always the case. We have 


bu biz 
by bz 


ail 12 


= (411422 — 421412)(b11b22 — by b12) 
a21 422 


= 441822011 b22 — aaz babiz — az1412b11b22 + A21412b21b12 
= A11422b11b22 + 412421b12b21 — 411422b12b21 — A124721 b11b22, 
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where we have reorganized the terms in the last row to make it easier to compare to det(AB). Speaking of 
which, we have 


a 
da (( 11 a te aay É 
Ax 922) \ba bz 
= (abn + 412b21)(421b12 + a22b22) — (ar bi2 + 412b22)(a21 b11 + 422621) 
= 441b11021 by 2 + 44101, a22b22 + a12b21421 b12 + a12b21422b72 


ayb +4y2b2, ay by2 + a12b22 
azb + 22b71 aznb12 + a22b22 


— 4101 2021D1, — 411b12422b21 — 412b22421 b11 — a42b22A2b2}. 


We can clean this up a bit by organizing the terms in each product. When we do so, we see that we have 
some convenient cancellation: 


aii 412 bu biz 
det (( )( = 44102101, b10 + 411422b11 b22 + 412021 b10b21 + ay 2A2bnb 
aian) ba boo 11421011012 + 411422011022 + 412421012021 + 412422021 022 


— ay1421b11b12 — 411422b12b21 — 412421 b11 b22 — A12422b21 b22 
= q11422b11 b22 + 412421b12b21 — a11422b12b21 — A124721 b11 b22, 


which does indeed match our expression above for det(A) det(B). 


| Important: We have det(AB) = det(A) det(B) for any 2 x 2 matrices A and B. 


WARNING!! The values of det(A) + det(B) and det(A + B) are not necessarily equal. 


a + 
10.4.1 Use Cramer’s rule to solve the system 3x + 4y = —3x + 7, 5y = -2x + 9. 


10.4.2 Evaluate the following determinants: 


-3 -5 
ey | 6 a 


2400 3200 
©) Boe va 


10.4.3 Is it true that det(AB) = det(BA)? 


10.4.4 The first column of B is the same as the first column of A. The second column of B is the sum of the second 
column of A and k times the first column of A. Show that det(A) = det(B). 


10.4.5 We showed in the text that det(A) + det(B) does not necessarily equal det(A + B). 


(a) What if det(A) = det(B) = 0? Then, must we have det(A) + det(B) = det(A + B)? 
(b) What if we have det(A) = 0 but det(B) # 0. Then, must we have det(A) + det(B) = det(A + B)? 
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10.5 Geometric Interpretation of the Determinant 


Problem 10.28: In this problem, we investigate the geometric significance of the determinant of a 2 x 2 matrix. 


Let M = (? ‘) , let O be the origin, and let V and W be the points such that V = Mi and W = Mj. 


(a) Show that the area of the parallelogram with OV and OW as sides equals || ||| W/| sin 0, where 8 is the 
angle between V and Ñ, 


(b) Express the area in terms of We W, | VIL, and [WI]. 


(c) Find the area in terms of a, b, c, and d. 


Problem 10.29: 
(a) What is the geometric significance of a 0 determinant? 


(b) Find several matrices with a negative determinant, and draw the corresponding parallelograms. Make a 
conjecture about the geometric significance of a negative determinant. 


Problem 10.30: Use the geometric interpretation of the determinant to explain why det(kA) =K det(A) for any 
2x2 matrix A. f 


Problem 10.31: Use the geometric interpretation of the determinant to explain why adding a multiple of either 
column of a 2 x 2 matrix to the other column leaves the determinant of the matrix unchanged. 


Problem 10.32: In this problem, we find a method for finding the area of a triangle given the coordinates of its 
vertices. 

(a) Find the area of a triangle with vertices (0, 0), (5,4), and (1, —3). 

(b) Find the area of a triangle with vertices (4, 5), (7, 10), and (—3,7). 


(c) Find a formula for the area of a triangle with vertices (x1, y1), (x2, y2), and (x3, y3), where these vertices 
are in counterclockwise order. 


OV and OW as sides is the absolute value of det(M), where M = ( 2) 


Solution for Problem 10.28: First, we find the fourth vertex of the parallelogram by 
noting that if P is the point such that V + W = P, then OVPW isa parallelogram. 
Since AOVW = APWV, we have [OVPW] = 2[0VW] = 2(3(OV)(OW)sin8), 
where 0 = ZVOW. Therefore, we have [OVPW] = |v IWI sin 0, where @ is the 
angle between V and W. The dot product gives us another relationship involving 
V, W, and 0: 

PÑ = [PIIRI] cos 0. 
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We can now eliminate sin 0 from our expression for [OVPW]. Since 0° < 0 < 180°, we have sin@ > 0, so we 
take the positive root from sin 0 = V1 — cos? 0, and we have 


[OVPW] = ||V||||W]| sine 
= || VII WI] Vi - cos? 8 
Be (V- Wy 
SAN ey b= ee 
APAI? 
=v VP wr - 0 wy 
= y/ (a2 + 2)(2 + d2) — (ab + cd}? 
= Va2b? + ad? + bec? + c2d2 — a2b? — 2abcd — c2d? 
= Vad? — 2abcd + bc? = 4/ (ad — bc)? = jad — bel, 


as desired. O 


Important: The absolute value of the determinant l A equals the area of the parallelo- 


gram that has as two sides the vectors E) and H drawn from the origin. 


Repeatedly referring to the “absolute value” of det(M) is pretty unwieldy. Let’s take a closer look at what a 
negative determinant corresponds to geometrically. 


| Problem 10.29: 
| (a) What is the geometric significance of a 0 determinant? 


| (b) Find several matrices with a negative determinant, and draw the corresponding parallelograms. Make § 
| a conjecture about the geometric significance of a negative determinant. 


(c) 


Prove your conjecture in part (b). 


Solution for Problem 10.29: 


(a) Ifv and w are nonzero, then the parallelogram with sides v and w drawn from the origin has area 0 if 
and only if v and w are in the same or opposite directions. This occurs if and only if v is a multiple of w. 


Wj 


7 U0 : 2 z K 
This corresponds to the fact that the determinant g w, | 8 Zero if and only if the columns are linearly 
2 w2 


dependent. 
(b) To learn the significance of a negative determinant, we examine a few cases. Below are the parallelograms 
À 4j -3 4 =} =2 
whose areas correspond to the determinants 1 blz aV an 3 o | 
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(c) 


In each case, the determinant is negative. In each case, the smallest rotation from the vector given by the 
first column to the vector given by the second column is a clockwise rotation, as indicated by the arrows in 
the diagrams. 


Swapping the columns of a 2 x 2 matrix produces a matrix whose determinant is the opposite of the 


2| |4 -3 
determinant of the original matrix. Doing so for our three example matrices gives us i ibla 2} 
and Pe z , and the corresponding parallelograms are below. 


These are the same parallelograms as before, since the columns of the matrices are the same as before, 
only in opposite order. However, now the smallest rotation from the direction of the first column to 
the direction of the second is counterclockwise. We conjecture that the determinant of a 2 x 2 matrix is 
positive if and only if the smallest rotation from the vector of the first column to the vector of the second is 
counterclockwise. 

A (If v or w is 0, then det(A) = 0.) We 
need to relate det(A) to the angle between v and w, so we let 0 be the smallest rotation that must be applied 
to v to make it have the same direction as w. (As usual, if 0 > 0, this rotation is counterclockwise, and if 
cos -sin0 
sin cos0 


Let the columns of A be nonzero vectors v and w, so A = ( 


0 < 0, then this rotation is clockwise.) Therefore, the vector ( ) v has the same direction as w, 


so there is some positive constant k such that 
= i — sin a BEN S se = = ie cos 0 — kv sin A) 
~"“\sin@ cos@ ~“\sin@ cos v)  \ku sin + kv2 cos 0/ ` 
(Specifically, k = ||wl} / |IvIl.) 


Therefore, we have 


vı kvucos® — kv sin 0 


a= V2 kv, sin@ + kv2 cos 0 


= vı(kvı sin 0 + kv2 cos 0) — v2(kv; cos 0 — kv2 sin 0) 


= k(v? + v3) sin 0. 


Aha! Now we have the whole story. Since k, vj, and v3 are all nonnegative, the sign of det(A) must match 
the sign of sin 0. We have sin 0 > 0 for 0 < 0 < n, and sin@ < 0 for -r < 0 < 0. Since positive values of 0 
correspond to counterclockwise rotations and negative values of 0 correspond to clockwise rotations, we 
see that the determinant of a matrix is negative if the smallest rotation from the direction of the first column 
to the direction of the second column is a clockwise rotation, and positive if the smallest such rotation is a 
counterclockwise rotation. If the two columns have the same or opposite directions, then the columns are 
linearly dependent and the determinant is 0. 
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We can get rid of the “absolute value” in the statement, “The absolute value of the determinant i equals 


d 


the area of the parallelogram that has as two sides the vectors (*) and m drawn from the origin” by using the 


concept of signed area. We define the signed area of a polygon by first assigning an orientation, clockwise or 
counterclockwise, to the polygon. If the orientation is counterclockwise, as shown in the diagram at left below, 
then we say the signed area is positive. If the orientation is clockwise, then we say the signed area is negative. 


Yy Y 
(b, d) (a,c) 
(a,c) (b, d) 
X x 


Figure 10.1: A Parallelogram with Positive Area Figure 10.2: A Parallelogram with Negative Area 


With this definition in hand, we can refine the relationship between the determinant of a matrix and area. 


The value of the determinant : A equals the signed area of the parallelogram | 


that has as two sides the vectors | and (5) drawn from the origin, where | 


the orientation of the parallelogram is determined by the path around the 
parallelogram with first step from the origin to (a, c). : 


We learned in Section 10.3 that multiplying a vector by a matrix corresponds to a transformation of the vector. 
Our work so far in this section tells us about the effect of such a transformation on area. 


Specifically, we showed that if M = ie A and we let v = Mi = a and w = Mj = (4) , then the signed area 


of the parallelogram with sides v and w is det(M). Since the signed area of the parallelogram with sides i and j is 
simply 1, we see that multiplication by M scales the area of this parallelogram by det(M). 


We can extend this result to any two linearly independent vectors, not just i and j. If r and s are linearly 
independent, then the signed area of the parallelogram with sides Mr and Ms is det(M) times the signed area of 
the parallelogram with sides r and s. You'll be asked to prove this claim as a Challenge Problem. 


| Problem 10.30: Use the geometric interpretation of the determinant to explain why det(kA) = det(A) for 


any 2 x 2 matrix A. 


Solution for Problem 10.30: Let P be the parallelogram with sides v and w drawn Y 

from the origin, and let Q be the parallelogram with sides kv and kw drawn 

from the origin. Q is formed by scaling the sides of P by a factor of k. Therefore, 

Q and P are similar, and the ratio of the lengths of corresponding sides is k, 

which means the ratio of their areas is k’. y 


In our discussion above, we've glossed over the issue of negative determi- ky A 


nants and what happens when k < 0. Our discussion in Problem 10.29 takes 
care of both these concerns. The smallest rotation from the direction of v to the 
direction of w is the same as the smallest rotation from the direction of kv to 
the direction of kw (no matter what the sign of k is). Therefore, the determinants of A and kA must have the same 


sign. O 


kw 
xX 


367 


CHAPTER 10. MATRICES IN TWO DIMENSIONS 


Problem 10.31: Use the geometric interpretation of the determinant to explain why adding a multiple off 
either column of a 2 x 2 matrix to the other column leaves the determinant of the matrix unchanged. 


Solution for Problem 10.31: To add kw to v geometrically, we first draw v from 
the origin to V, and then draw kw starting from V. Because kw is parallel to 
w, the head of v + kw, point T in the diagram, is on the line through V parallel 
to w. Therefore, the distances from V and T to Ow are equal, which means 
that the areas of OWV and OWT are the same. This means that the area of the 
parallelogram with sides v and w (parallelogram OVPW) equals the area of 
the parallelogram with sides v + kw and w (parallelogram OTUW). 


Letting v = o and w = o this gives us 
U2 W2 


V,+kw, wy 
Vg +kw2 w2 


Oi Wi 
U: W2 


We haven't yet explained why the signs of these determinants are the same. When determining the orientation 
of the parallelogram corresponding to each of the determinants above, the last step in the path around the 
parallelogram in each is from W to O. So, the two parallelograms have the same orientation if these last steps 
complete circuits about the parallelograms with the same orientation. This is the case if and only if V and T are 
not on opposite sides of OW. Because VT || OW, we know that V and T (the heads of v and v + kw) cannot be 
on opposite sides of OW. Therefore, the orientations of the parallelograms corresponding to the determinants are 
the same, so the signs of the determinants are the same. (In the case shown, both are clockwise, and hence have 
negative area.) 


Similarly, we can add any multiple of the first column to the second without changing the determinant. 0 


Problem 10.32: In this problem, we find a method for finding the area of a triangle given the coordinates of 
its vertices. 
(a) Find the area of a triangle with vertices O = (0,0), A = (5,4), and B = (1, -3). 
(b) Find the area of a triangle with vertices P = (4,5), Q = (7,10), and R = (—3,7). 
(c) Find a formula for the area of a triangle with vertices (x1, y1), (x2, y2), and (x3, y3), where these vertices § 
are in counterclockwise order. _ 


Solution for Problem 10.32: 


(a) We know how to handle a parallelogram, so we “complete” the parallelogram by 
finding the point C such that C = A+B. As shown, this point is (5 + 1,4 + (—3)) = (6,1). 
Since AOAB = ACBA, we have 


1 


[OAB] = ZTOACB] = ; det C =| = TE o- = 


(b) From part (a) we know how to find the area of a triangle with one vertex at the origin, so we translate the 
triangle so that P is at the origin. The translation that maps (4, 5) to (0,0) also maps (7,10) to (7 — 4,10 —5) = 
(3,5) and maps (3,7) to (-3 — 4,7 — 5) = (-7,2). Therefore, we now seek the area of the triangle with 
vertices (0,0), (3,5) and (—7,2). By the same argument as in part (a), the area of this triangle is 
3 -7 1 4 

det(3 7) = 51@@ -(-NOI= > 


` 


1 
z 


L 
2 
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(c) Part (b) gives us a guide; we start by translating one vertex to the origin. The same translation that maps 

(x1, y1) to (0,0) also maps (x2, y2) to (x2 — x1, y2 — y1) and maps (x3, y3) to (x3 — x1, y3 — y1). Since the vertices 
y2-y pP y Y3- y 

(x1, y1), (X2, y2), and (x3, y3) are in counterclockwise order, so are the vertices (0,0), (x2 — x1, Y2 — yi), and 

(x3 — x1, Y3 — y1). The area of this triangle is half the area of the parallelogram with ee i” a and & E a 


Y2 — Y1 Y3 — Yı 
as sides, so the desired area is 
LE: Ce = aa l 
2 y2- Y1 Y3- Yı 


Note that if the vertices were in clockwise order rather than counterclockwise, then the expression above is 
negative, and gives the signed area of the triangle. 


O 


I txercises 


10.5.1 Find the determinant of the rotation matrix E 7 =en 


dng. oD i: Explain the result geometrically. 
10.5.2 Find the area of the quadrilateral with vertices (—2,3), (1, —5), (7, —4), and (8, 1). 


10.5.3 Explain geometrically why multiplying one column of A by a constant k produces a matrix with determi- 
nant k det(A). 


10.5.4 A lattice point is a point in the Cartesian plane with integer coordinates. Suppose 7 is a triangle in the 
Cartesian plane such that all three vertices of J are lattice points. Show that doubling the area of 7 must produce 
an integer. 


10.5.5x In Exercise 10.1.3, we discovered a matrix F such that multiplying a matrix on the right by F swaps the 
columns of the matrix. That is, the first column of AF is the second column of A, and the second column of AF is 
the first column of A. Explain geometrically what the determinant of F must be. 


10.6 Inverting a Matrix 


We have seen that we can solve the system of equations described by 


=) G) = Gio) 
2i iy 0 
with Cramer’s rule. Rather than grinding through Cramer’s rule, it sure would be nice to be able to divide by the 


matrix on the left. Unfortunately, division by a matrix is not defined. However, we can take another look solving 
equations with division by real numbers to get an idea how we might accomplish our goal. Consider the equation 


oie = 9). 


To solve this equation we can “divide both sides by 3.” What we’re really doing when we “divide both sides by 
3” is “multiplying both sides by 5." More precisely, we are multiplying by the multiplicative inverse of 3, which 
eliminates the 3 on the left side. Multiplying both sides of 3x = 9 by $ gives 


The 4 and 3 cancel, and we're left with x = 1 -9 = 


EDS TT 
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Let’s see if we can accomplish something similar with matrices. 


i Problems _ 3 


Problem 10.33: Suppose that ABv = v for all vectors v. What matrix must AB equal? 


Problem 10.34: If AB equals the answer you found in Problem 10.33, then we say that A is the inverse of B and 


that B is invertible. We often denote the inverse of a matrix B as B™!. Let M = e R Find M7!. 


3 


Problem 10.35: Does the matrix M = ine 10 


) have an inverse? If it does, then find it. If it doesn’t, explain 


why not. 


Problem 10.36: If A is invertible, then how are det(A) and det(A~') related? 


a b 
c d 
find the inverse in terms of a, b, c, and d. 


Problem 10.37: Let A = ( ). In this problem, we find the conditions under which A is invertible, and we 


(a) Suppose A is invertible, and let A™! = c . What matrix must A™A equal? 

(b) Use your answer to (a) to find a system of equations in terms of p, 4, a, b, c, and d. Solve the system for p 
and q in terms of the components of A. 

(c) Find r ands in terms of the components of A. 


(d) Under what conditions is it possible to find an inverse of A, and what is that inverse in terms of a, b, c, 
and d? 


(e) Confirm your result from part (d) by multiplying A by the matrix you found in part (d). 


Problem 10.38: 
(a) Find the inverse of ( ape 


(b) Use your answer to part (a) to solve the system of equations 4x — 3y = —2, 5x + 7y = —3. 


: A = 5 
(c) Find the inverse of wi 4 i 


Use your answer to part (c) to solve the system of equations —0.1t + 5u = —3, —0.4t + 30u = 7. 


) ana 8 = ( 


Problem 10.40: Earlier we saw that AB is not necessarily equal to BA. Must AA™ equal A7!A? 


4 
2 


= 6) 


Problem 10.39: If A = ( 1 


F: >) , then find all matrices M such that AM = B. 


Problem 10.41: Suppose A and B are invertible. Prove that (AB)! = B7!A7'. 


Problem 10.33: Suppose the product ABv = v for all vectors v. What matrix must AB equal? 
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Solution for Problem 10.33: In Problem 10.13, we discovered the identity matrix I, which is the unique matrix such 
that Iv = v for all vectors v. If ABv = v for all vectors v, then AB must be this matrix. That is, we must have 


aB= (3 e 


Problem 10.34: If AB = I, then we say that A is the inverse of B and that B is invertible. We often denote the 
inverse of a matrix B as B~. Let M = E A Find M”. 


6 7 


Solution for Problem 10.34: We seek the matrix such that 
k (G >) k F a 
c d) \6 7) \0 1/’ 
Expanding the product on the left, we have 
ca 6b 3a+ i) _ i i 
2c+6d 3c 7d) \0 1/' 
From the entries in the first rows, we have 2a + 6b = 1 and 3a +7b = 0. Solving this system gives a = —4 and b = 
From the entries in the second rows, we have 2c + 6d = 0 and 3c + 7d = 1. Solving this system gives c = 3 an 


d = —}. Checking, we find that 
e aa, (2 3) 7 ( 0 
3/2 1/2) \6 7) \0 ly 


Ata 


a 


so M” = za 3/4 J 


Cs lly 


3 


| Problem 10.35: Does the matrix M = é 6 


wa A have an inverse? If it does, then find it. If it doesn’t, explain 


| why not. 


4 is the inverse of M. Then we must have AM = I, or 


(e a) (% to) =(0 1): 


ie —6b 5a- e A li a 
3c-6d 5c-10d/) \0 1/° 
From the entries in the first rows, we have 3a — 6b = 1 and 5a — 10b = 0. The second equation gives a = 2b, but 


substituting this into the first equation gives 6b — 6b = 1. Uh-oh; this equation has no solution! Therefore, we 
cannot find a matrix A such that AM = I, which means that M does not have an inverse. 0 


Solution for Problem 10.35: Suppose A = b 


Expanding the product on the left gives 


Why doesn’t M have an inverse? Here’s a clue: 


| Problem 10.36: If A is invertible, then how are det(A) and det(A™’) related? i 


Solution for Problem 10.36: We have AA = I, so det(A™'A) = det(I) = (1)(1) — (0)(0) = 1. We also have 
det(A7!A) = det(A~') - det(A), so we must have det(A')- det(A) = 1. 0 


OF 
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3 5 
=p =Q 


det(M) = (3)(—10) — (—6)(5) = 0, 


we know that there can be no matrix M7 such that det(M~!) - det(M) = 1. 


Now we see why the matrix M = ( ) in Problem 10.35 has no inverse. Since 


Problem 10.37: Let A = e A . Under what conditions does A have an inverse? If A has an inverse, what is f 


that inverse in terms of a, b, c, and d? 


Solution for Problem 10.37: Suppose A is invertible. We let A™! = K a , so we must have 
( p gq (e _ ¢ a 
r s ) e dH TO 1 


ee Hele a 
ra+sc rb+sd;) \O 1/° 


Expanding the product on the left gives 


Equating the corresponding entries on the first rows gives the system pa + qc = 1, pb + qd = 0, and equating the 
corresponding entries on the second rows gives ra + sc = 0, rb + sd = 1. If ; i + 0, then applying Cramer’s rule 


to solve the system pa + qc = 1, pb + qd = 0 for p and q gives 


e la 1 
foa a P _ |b Oo] -b 
ae a c| la c E z a c|) lac ‘ 
b d b d b d b d 
Similarly, if : i + 0, then we can apply Cramer’s rule to ra + sc = 0, rb + sd = 1 to give 
0 c a 0 
1 d —¢ b 1 a 
r= = and s= = 
a c m @ a c a c 
b d b d b d b d 
Since 5 = ad — bc = det(A), the denominator in each of the results for p, q, r, and s equals det(A), and we have 
= íl d p 
Li 
es det(A) ie a j` 
Checking, we find that 


pane 1 iS ae E 1 Cae Be 1 ce 0 vale a 
~ det(A) \-c @/\c d/` det(A) \-ca+ac -cb+ad/~ det(A) 0 ad—be}~ \O 1)’ 


as expected. 


Our work above shows that we can find an inverse of any 2 x 2 matrix with a nonzero determinant. We say 
that such a matrix is invertible. Our solution also tells us that if A is invertible, then its inverse is unique. That is, 
if A is invertible, there is only one matrix A such that AA = 1. o 
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b 
Important: Let A = K “ik If det(A) = 0, then A does not have an inverse. Otherwise, we 
have | 
fd 
A7! = ( ) l 
T 
Problem 10.38: a a e e 
| (a) Find the inverse of G eT 


(b) Use your answer to part (a) to solve the system of equations 4x — 3y = —2,5x + 7y = —3. 


f , -0.1 5 
Find the inverse of E 4 a 


Use your answer to part (c) to solve the system of equations —0.1¢ + 5u = —3, —0.4t + 30u = 7. 


Solution for Problem 10.38: 


(a) The determinant of the matrix is (4)(7) — (—3)(5) = 43, so the inverse of the matrix is 
1 ( 7 3 s ( 5 d 
43 \-5 4 -5 Ż 
l : 4 =3\ (x 2 T : : à 
(b) We can write the system of equations as ( 5 7 ) (*) = (3) . Multiplying both sides of this equation by 


the inverse we found in part (a), we have 


& B\(4 -3\(*\_(® B)\(-2)_(-8 
BN 7 | es \ a eg eR a 
43 43 43 43 


The product of a matrix and its inverse is the identity, so we have @ = x = i 


(c) The determinant of the matrix is (—0.1)(30) — (5)(—0.4) = —1, so the inverse of the matrix is 
y Ae, T 
—1 \0.4 —0.1/ \-04 0.1/° 


(d) We write the system as av a A = =) . Multiplying both sides by the inverse we found in part 


Gin aes ol ceria) a 


: t 
As before, the product of a matrix and its inverse is the identity, so we have e) = F a , 


(c), we have 


-3 -5 3 
Problem 10.39: If A = 1 ) and B = = à 


Solution for Problem 10.39: We can’t divide by A, but because its determinant is nonzero, we can multiply by its 


1 [eS 0.1 03 7 : ; 
inverse. The determinant of the matrix is 10, so At= 10 E 3 = E 2 0 aj What’s wrong with this next 


step: 


a aaaaaaaaaaaaaaaalluliÅÂÃÂiIOOOOI 
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Bogus Solution: | Multiplying both ae of the given equation by A“ gives 1, ae 


The problem here is that we started with AM = B and multiplied AM on the left by A“! to get IM, but we 
multiplied B on the right by A~’. While we can go from t = s to rt = sr with real numbers, we can’t go from T = S 
to RT = SR with matrices, since matrix multiplication is not commutative. 


= IMBA". | 


We multiply both sides of the equation AM = B on the left by A™! to produce A™AM = A™B. The left side 
then becomes IM, which equals M, so we have 


m x) a E m 
bls A a oaia 2) aaa 


So, we see that there is a unique solution to the equation AM = B. Moreover, our solution shows us that whenever 
A is invertible, there is a unique solution for M to the equation AM = B, which is M = A™B. o 


Problem 10.40: Earlier we saw that AB is not necessarily equal to BA. Must AA equal A7'A? 


Solution for Problem 10.40: We have defined A`! as the matrix such that A™!A = I, so to check if AA“! = AT! A, we 
need only compute AA™. Letting A = (‘ Al , we have 


wl ) atin Daal CL Da wa) 


Therefore, AA™! = AIA = I. O 


We therefore see that if A is invertible, then the inverse of A7! is A. 


‘Important: If A is invertible, then we have AA? = A'A =I. 


Problem 10.41: Suppose A and B are invertible. Prove that (AB) = Be Ase 


Solution for Problem 10.41: If M is the inverse of AB, then we must have M(AB) = I. So, we test M = B'A™!, and 
using the associative property, we find 


(B-'A7!)(AB) = B-1(A71(AB)) = B7! ((A71A)B) = BIB = BB = I. 
Therefore, the inverse of AB is B'A7!. o 


10.6.1 What is the inverse of I? 


10.6.2 For each of the following matrices, find the inverse or show that the matrix does not have an inverse. 
6 8 2 -1 —40 30 
(a) E i) ey é a © ( 8 ef 
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10.6.3 Suppose we are given matrix A and vector v. 


(a) If Ais nonsingular, must there be a vector w such that Aw = v? 


(b) If A is singular, can we conclude that there is no vector such that Aw = v? 


10.6.4 All of the entries of M and M™ are integers. Find all possible values of det(M). 


10.6.5 In Problem 10.41, we showed that if A and B are invertible, then (AB)! = B-'A“!. If A and Bare invertible, 
then must we have (A + B)! = B~! + A712? 


10.6.6 In Problem 10.18, we found that multiplying G by the matrix ie ce) corresponds to multiplying the 


complex number x + yi by a + bi. Find the matrix M such that multiplying b by M corresponds to dividing 


x + yi by a + bi (assuming a and b are not both 0). 


10.7 Summary 


A 2 X 2 matrix represents a linear function that maps vectors in 2 dimensions to vectors in 2 dimensions. This 
mapping is typically illustrated as a product of the matrix and a vector according to the definition 


k 1) (*) 3 E + A 
c d) \y)  \cx+dy/` 
For any matrix A, any scalar c, and any vectors v and w, we have A(cv) = c(Av) and A(v + w) = Av + Aw. 


We add two matrices by adding the corresponding components in the matrices, and define the product of two 
matrices as follows: 
Ge be & A os Ce +anbzn anbi + a 
an an) \ba bn)  \azbn + ax2ba anbi + anban) ` 


Matrix multiplication is not commutative, but it is associative and distributive. 


If v and w are linearly independent and A and B are matrices such that Av = Bv 
and Aw = Bw, then A = B. 


reee e e 


TE 


is called the identity matrix. Multiplying a vector v by I leaves the vector 
unchanged; in other words, Iv = v for all vectors v. We also have IA = AI = A 
for any 2 x 2 matrix A. 


The matrix 


Any linear function that maps vectors to vectors can be expressed as multipli- 


W cation by a matrix, where the columns of the matrix are given by the vectors to 
which i and j are mapped by the function. 
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Definition: The determinant of the matrix k 5 is the quantity ad — bc. We often denote this determinant 


, SO 


. a 
as 
C 


d 
a b 
c d 


We also can denote the determinant of a matrix A as det(A). 


= ad — be. 


We say a matrix is singular if its determinant is 0, and nonsingular if its determinant is nonzero. If det(A) # 0, 
then for any vector c, there is unique vector v such that Av = c. 


The determinant has the following properties: 


a b+ka 
c d+kc 


a b 
c d 


atke b+kd 
c d 


a b 7 
c+ka d+kb| — 


a+kb b 
c+kd d 


e For any 2 x 2 matrix A and any constant k, we have det(kA) = k? det(A). 


Multiplying either column or either row of a matrix A by any constant k gives a matrix with determinant 
kdet(A). 


For any 2 x 2 matrix A, swapping the rows or columns of A produces a matrix with determinant (—1) det(A). 


e The determinant of a 2 x 2 matrix is 0 if and only if the rows of the matrix are linearly dependent. 


We have det(AB) = det(A) det(B) for any 2 x 2 matrices A and B. 


equals the area of the parallelogram that has as two sides the 


The absolute value of the determinant j ; 


vectors (9 and (5) drawn from the origin. 


If BA = I, then we say that B is the inverse of A and that A is invertible. We typically denote the inverse of A 
as A™!, and we have AA? =A7TA=I. 


Important: z Let A = (5 1) . If det(A) = 0, then A does not have at inverse. Otherwise, we 
have 


1 d -b 
-1 _ 
as det(A) f a ) i 


a OS ST eiii: 


REVIEW PROBLEMS pe 


10.42 Find constants a, b, and c such that ( A +c ( a ) = ( 


32 2 
Z 2 =ð =í 


= =o) © 


5 = 


2 3 
10.43 LetM = ic a and N = € 2 


) . Find MN and NM. 
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10.44 Show that the rows of a matrix are linearly dependent if and only if the columns of the matrix are linearly 
dependent. 


10.45 Is there a matrix F such that for all matrices A, the product FA is the matrix formed by swapping the 
columns of A? If such a matrix exists, what is AF for that matrix? 


10.46 Suppose Av = (3) and Aw = ( 4 


) . Find A(3v — 4w). 
10.47 


(a) Find two matrices besides 0 and the identity such that A? = A. 
(b) Find a matrix besides 0 and the identity such that A’ = A and A® + A’. 
10.48 
(a) Find a matrix A such that Av is the reflection of v over the y-axis. 
(b) Find a matrix B such that Bv is the reflection of v over the graph of x = y. 
10.49 What is the matrix R such that Rv is a clockwise rotation of v by an angle @ for all vectors v? 


cos -sin0 
sin cos@ 
of 20 and 390, respectively. 


10.50 LetT = ( ) . Compute T? and T° and confirm that these equal the rotation matrices for angles 


10.51 Find M” if M = (4 o 


vi v2 


10.52 Point P is reflected over the y-axis and the result is rotated 60° counterclockwise about the origin. The 
point that results is (—2 — 3, 1 — 2 V3). What are the coordinates of point P? 


10.53 Show that f(0) = 0 for any linear function f that maps vectors to vectors. 
10.54 Evaluate the following: 


7 A 


0.27 -175 
oR 


G ea 95 


(a) Must the product of two singular matrices be singular? 
(b) Must the product of two nonsingular matrices be nonsingular? 


(c) Is the product of a singular matrix and nonsingular matrix always singular, always nonsingular, or some- 
times singular and sometimes nonsingular? 


10.56 Suppose we form the first row of C by adding k times the second row of A to the first row of A, and we 
form the second row of C by adding j times the first row of A to the second row of A. Must the determinants of 
A and C be the same? 


10.57 Explain geometrically why swapping the columns of A produces a matrix with determinant (—1) det(A). 


10.58 For each of the following matrices, find the inverse, or show that the matrix does not have an inverse. 


=} ð 0 -3 6 12 
(a) S 3) (b) G 0 ©) (5 g 
10.59 Suppose that A™' = A. Find all possible values of det(A). 


o n r Im nn So 
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10.60 The transpose of a matrix 6 A is F ab We denote the transpose of A as A7. 


(a) Show that (A +B)" = AT + B7. 
(b) If Ais invertible, must A7 be invertible? 
(c) If A’ is invertible, must (A7)! equal (A~!)!? 


-1 4 © o 
your results, try to prove that what you noticed holds for any matrices R and S. 


(d) LetR = ( - 7 and S = & J Find RS, (RS)', R'S!, and STR". If you notice anything interesting in 


Challenge Problems 


10.61 Find two nonzero matrices A and B such that neither matrix is the identity, A + B, and AB = BA # 0. As 
an extra challenge, find a pair such that no entry in either matrix is 0. Hints: 118 


10.62 Suppose A®° = A. Find all possible values of det(A), assuming all the entries of A are real. What if we 
allow the entries of A to be nonreal? 


10.63 Find a matrix such that A‘ = A, but A? # A. Hints: 230, 202 
10.64 Let A and B be two 2 x 2 matrices. Prove that det(A + B) + det(A — B) = 2(det(A) + det(B)). 


Aa eh Yy- 
s= ee 


10.65 Explain why the line through the points (x1, y1) and (x2, y2) is the graph of the equation 
evi 
10.66* Let A= e >) : 


Fst Fy 
Fy el 


(b) Prove that Fy1F,-1 — F2 = (-1)" for all n > 0. Hints: 136 


(a) Prove that A” = ( ) , where F, denotes the n Fibonacci number. 


10.67 Describe all (x, y) such that there exists a value of 0 for which x cos 0 — ysin 0 = 3 and xsin 0 + ycos 0 = 2. 
Hints: 115 


10.68 
(a) Show that reflection over any line through the origin is a linear transformation. 


(b) Find the reflection of (4, —6) over the line y = 2x. 


(c) Let f be a function that maps a point (x, y) to the image of the reflection of (x, y) over the line x + y = 6. Is f 
a linear function? 


-2 6 
10.69 LetA= F Ar 


(a) Find all nonzero vectors v such that there exists a constant A for which Av = Av. 


(b) For each vector v and corresponding A you find in part (a), find det(A— AI). Explain your results. Hints: 213 


10.70 Suppose the columns of A are orthogonal. Find both a geometric and an algebraic proof that | det(A)| 
equals the product of the norms of the columns. 
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CHALLENGE PROBLEMS 


10.71x Suppose that the points (x1, y1), (X2, Y2), ---) (Xn, Yn) are the vertices of a convex x M 
polygon, in counterclockwise order. We can find the area of this polygon with the following Le 
process: X$ 


1. List the vertices in order, as shown in the middle two columns at right, and X21 pA NEN XPY2 
include (x1, y1) a second time at the end of the list. VEE a 
X32 : » X243 
2. Take pairwise diagonal products, as shown in the first and fourth columns at l i 


right. : Xn p 


3. Sum the first column and sum the fourth column. The area of the polygon then XnYn-1 `y Xn-1Yn 
is half the difference of the fourth column sum and the first column sum. y 
n 


x4 Xryı 


This fact is sometimes sometimes referred to Shoelace Theorem. 


(a) Prove the Shoelace Theorem for n = 3. 

(b) Prove the Shoelace Theorem for n = 4. Hints: 18 

(c) Prove the Shoelace Theorem for any integer n with n > 3. Hints: 287 
10.72 Back on page 367, we made the claim that the parallelogram with sides Ar and As has signed area det(A) 
times the signed area of the parallelogram with sides r and s, where the orientation is determined by traveling 
first from the origin along r. We proved this statement when r and s are i and j. In this problem, we'll prove it for 
any linearly independent pair of vectors r and s. 

(a) Why must there exist a matrix B such that Bi = r and Bj = s? 

(b) In terms of A and B, what is the signed area of the parallelogram with sides r and s? 


(c) In terms of A and B, what is the signed area of the parallelogram with sides Ar and As? Note that you 
cannot simply multiply your answer from part (b) by det(A); we are trying to prove such a multiplication 
works! 


(d) Complete the proof by showing that the parallelogram with sides Ar and As has signed area det(A) times 
- the signed area of the parallelogram with sides r and s. 


10.73 For any vector v, the product Tv is the reflection of v over the line y = 3x. Find det(T). Hints: 144 
10.74 Find all matrices A such that A? = I. Hints: 75, 116 

10.75 Isit true that A? — B? = (A — B)(A + B) for all matrices A and B? 

10.76% Let A and B be matrices such that A + B = AB. Show that AB = BA. Hints: 222, 277, 54 


4 


10.77x Let A be a2 x 2 matrix such that A (3) = (5 


) and A? — A + 5I = 0. Find A. Hints: 119 


10.78 
(a) If A? = 0, then must A = 0? 
(b)* If A? = 0, then must A? = 0? Hints: 221 
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CHAPTER 11. VECTORS AND MATRICES IN THREE DIMENSIONS, PART 1 


Every new body of discovery is mathematical in form, because there is no other guidance we can have. - Charles Darwin 


nad i 
= 


Vectors and Matrices in Three Dimensions, Part 1 


11.1 Vectors in Three Dimensions 


Vectors in three dimensions have much in common with vectors in two dimensions. Just as we denote a vector in 

a + 

two dimensions with an ordered pair G , we denote a vector in three dimensions with an ordered triple | b 
@ 


We define vector addition and multiplication by a scalar for vectors in three dimensions in essentially the same 
way we did in two dimensions: 


e Vector addition. We add vectors by adding the corresponding components: 


c oug; 


e Multiplication of a vector by a scalar. We multiply a vector by a scalar by multiplying each component of 
the vector by the scalar: 


We also define the dot product of two vectors in three dimensions much as we defined it for vectors in two 
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dimensions: 
Vi wi 
U2 |:| Wo | = 01: W1 + V2: W2 + 03° w3. 
03 W3 


X 
Just as we can represent the vector (*) as an arrow from the origin in the Cartesian plane to the point (x, y), 


x 


we can represent the vector s) as an arrow from the origin to the point (x, y, z) in space. With this in mind, 
Z 


x 
we can extend the concept of the norm of a vector to vectors in three dimensions. We define the norm of s) 

Z 
as the distance between (x, y, z) and the origin. We showed on page 178 that the distance between (x1, y1, z1) 


and (x2, y2, Z2) in space is y (x2 — x1)? + (y2 — 1)? + (z2 — z1)?. Therefore, the distance from (x, y, z) to the origin is 


yX + y* + 22. As in two dimensions, we denote the norm of v as ||v||, so we have 


x 
(:) = P d YP A 


Z 
Also as in two dimensions, the norm of a vector is sometimes referred to as the vector’s length or magnitude. 


a ba 
A vector with magnitude 1 is called a unit vector. The vector | b } is also written 
c 
ai + bj + ck, where i, j, and k are unit vectors from the origin in the direction of the 
positive x-axis, y-axis, and z-axis, respectively. Just as with vectors in two dimensions, 
a nonzero vector in three dimensions can be written as the product of its magnitude 
and a unit vector that is sometimes referred to as the direction of the vector. 


|, Probiems ig 


Problem 11.1: Let u = ( 


(a) Findu+v. 
(b) Find 3u — 5v. 


(c) Find x and y if 2w — u = tv for some constant t. 


2 —1 
Problem 11.2: Evaluate | -3 |-| 3 | and (—5i+3j — 2k) - (5i — 4j + 0k). 
4 5 


Problem 11.3: Show that for any scalar c and any vectors u, v, and w, we have: 
(a) v-w=w-v 


(b) v- (cw) = c(v- w) 


(c) u-(v+w)=u-vtu-'w 
v 


2 
-v = Iivil 
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| Problem 11.4: Show that if v and w are nonzero vectors in three dimensions, then v + w = ||vj|i|wl] cos @, where 
8 is the angle between v and w when these vectors are drawn from the origin in space. 


We start with a little practice adding and subtracting vectors in three dimensions. 


ae 1 0 /x 
Problem 11.1: Letu={ 3 ],v= { -4 |,andwe= |{ 2 }. 
=5 5 y 


(a) Findu+v. 
| (b) Find 3u — 5v. 
(c) Find xand yif2w-u= 


Solution for Problem 11.1: 


îl 0 ` 1+0 iL 
(a) Wehave| 3 |+| -4]/=[ 3-4 | ={ -1 ]. 
-5 5 -5+5 0 
1 0 3 0 3 
(b) Wehave3| 3 |] -5| —-4]={ 9 |-| -20]={ 29 }. 
—5 5 -15 25 —40 

x 1 2x-1 0 0 
(c) Since 2 — eo) || = 1 and t| —4 | = | —4t |, we must have 2x + 1 = 0, 1 = —4t, and 

y =) 2y +5 5 5t 


2y +5 = 5t. From the first equation, we have x = —4. From the second equation, we have t = —4, and 


substituting this value of t into the last equation gives y = -3, 


Solution for Problem 11.2: Applying the definition for the dot product from the introduction, we have 


2 si 
(3) í 3 ) = (2)(-1) + (-3)(3) + (4)(5) = -2 -9 +20 = 9 
4 5 


(-5i + 3j — 2k) - (5i — 4j + Ok) = (-5)(5) + (3)(-4) + (-2)(0) = -37. 


and 


Problem 11.3: Show that for any scalar c and any vectors u, v, and w, we have: 
(a) vw=wv 


(b) 


- (cw) =c(v: w) 


v 
(c) u- (v+w)=u: v+u: w 
v 


2 
v= [lvl 
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Solution for Problem 11.3: In each of the following parts, we let u = mi + u2j + u3k, v = vi + Voj + vsk, and 
w = wii + wj + wk. 


(a) We have v -w = vw + VW + V3W3 = WV] + W202 + W303 EW: V. 


(b) We have 
Vi CW, 
v- (cw) = | v2 |: | cw2 | = (v1)(cw1) + (¥2)(cw2) + (v3)(cw3) = c(viw + vzw + 0303) = c(v : w). 
03 CW3 
(c) We have 


u: (v + w) = uı(vı + w) + U2(V2 + w2) + U3(U3 3P w3) 
= (U1U1 + U2U2 + U3U3) + (U1W1 + U2W2 + U3W3) 
=u: v+u: wW. 
(d) We have v-v =v +0 +v = |v]. 
=) 


So, we see that the basic properties of the dot product that we showed for two dimensions also hold for three 
dimensions. 


Important: For any scalar c and any vectors u, v, and w, we have: 


vV'-w=w-v 
-(cw) = c(v- w) 
‘(V+tw)=u-v+u-w 


2 
-v = Ilvil 


Combining properties (a), (b), and (c) above tells us that the dot product of vectors in three dimensions is 
bilinear, just like the dot product of vectors in two dimensions. By this, we mean that for any vector v, the 
functions f (w) = v- wand f(w) = w - v are both linear functions. 


We also discovered a nice geometric interpretation of the dot product of vectors in two dimensions. Let’s see 
if this interpretation holds for vectors in three dimensions, as well. 


Problem 11.4: Show that if v and w are nonzero vectors in three dimensions, then v- w = ||{v|| Iwi] cos @, where 


O is the angle between v and w when these vectors are drawn from the origin in space. 


Solution for Problem 11.4: If we look back to our proof for vectors in two dimensions in Problem 9.9 on page 309, 
we have a pleasant surprise: the same proof holds in three dimensions! 


At right, we have points V and W, where V =vand W = w, and we let the origin be O. V 
We therefore have OV = ||v||, OW = ||w||, and WV = [wv || = jjv — wl]. 7 
The Law of Cosines gives us WV? = OV? + OW? — 2(OV)(OW) cos 0, or 
Iv — wl? = Iivi? + liwii? — 2 |Ivilllwil cos 8. 


Since ||v||* = v- v, Iwl? = w - w, and |lv — wil? = (v - w) - (v — w), we can write the Law of 


Cosines equation above as 
(v—w):(v-—w) =v-v+w-w-— 2|lv|| |lwl|cos 8. 
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Expanding the dot product on the left gives 
v-v—-2v-wt+w-w=v-v+w-w-2|lv|lllwi|cos@ 
and simplifying this equation gives the familiar 


v- w =||lv||[lwilcos 8. 


If you look back to our proof for vectors in two dimensions on page 309, you'll see that it looks very similar to 
our proof above! 


‘Important: If Ø is the angle between v and w, then 


Vw = |lvll{lwllcos 8. | 

As in two dimensions, we can use this relationship to determine the angle between two vectors, and if two 
vectors have dot product 0, then we say they are orthogonal. If orthogonal vectors are nonzero, then their 
geometric representations are perpendicular, since v - w = 0 for nonzero v and w if and only if the cosine of the 
angle between them is 0. 


11.1.1 Letu = 4i + 2j -kand v = —i + 5j + 2k. 


(a) Find w ifu -3w = 2v. 
(b) Do there exist nonzero scalars a and b such that au + bv = 0? 


(c) Does there exist a vector r such that there is no pair of scalars (x, y) for which xu + yv = r? 


1 3 —4 
11.1.2 For what values of x and y is (=) orthogonal to both (=) and ( 1 ) ? 
y 4 2 


/3 = 
11.1.3 Find the angle between 9 and ( 4 to the nearest degree. 
2 7 


11.1.4 Suppose the point (x, y, z) in space has spherical coordinates (p, 0, œ), where p > 0. If u = xi + yj + zk, then 
express @ in terms of u, i, j, and k. 


11.1.5 Letv = xi + yj +zk and w =i- 2j + 4k. 


(a) Describe the graph of the points (x, y, z) such that ||v|| = 4. 
(b) Describe the graph of the points (x, y, z) such that ||v — w|| = 6. 
(c)x Describe the graph of the points (x, y, z) such that 2v - w = ||v|| Ilw] 


11.2 3x3 Matrices 


Just as we used a 2 x 2 matrix to help represent a system of two linear equations in two variables, we can use a 
3 x 3 matrix to help represent a system of three linear equations in three variables. Specifically, we can represent 
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i G RE 


the system 


411X + A124 + 413Z = by, 
anx + A22Y + 4232 = by, 
431X + 432Y + 4332 = b3, 


Ai i2 443 xX by 
an ax a3 y |=| b 
A31 A32 433 z b3 


The product of the 3 x 3 matrix and the vector on the left is defined as 


M1, 2 M3 x 411X + A124 + 413Z 
421 A22 493 y | = | @aiX + Any + 493Z 
A31 432 433 Z 431X + A32Y + A33Z 


Just as a 2 x 2 matrix represents a function that maps vectors in two dimensions to vectors in two dimensions, 
a 3 x 3 matrix represents a function that maps vectors in three dimensions to vectors in three dimensions. We say 
that such a function has domain R°. 


Also as with 2 x 2 matrices, we typically use a capital letter to denote a 3 x3 matrix, and often use the lowercase 
of the letter together with subscripts to denote individual entries of the matrix. The first subscript refers to the 
row of the entry and the second refers to the column. So, for example, s13 refers to the entry of matrix S in the first 
row of the third column. 


Adding two 3 x 3 matrices and multiplying a 3 x 3 matrix by a scalar work essentially the same way they do 
for 2 x 2 matrices: 


a, a2 M3 bu bu b3 an +bu a+b 3+ b3 
an 422 423 |+| bn bo bz |= | an +bn an +b» a3 + bo 
43, 432 433 b31 b32 b33 43) + b31 a32 +b32 a33 + b33 

a. M2 M3 kay, kan kmg 

k| an an 423 |= | kan kan kaz 

431 432 433 kazı ka32 kaz3 


e ila 


Problem 11.5: Compute the following products: 


ee 4970 

Eis 0 5 || 3 
So) 9 ONG 
09 5 2\ /-3 
3 | (2 
Bere) \ 6 


Problem 11.6: Show that A(kv) = k(Av) and A(v + w) = Av + Aw for any 3 x 3 matrix A, any scalar k, and any 
vectors v and w in three dimensions. 
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Problem 11.7: In this problem, we find the product of two 3 x 3 matrices, Consider the system of equations 


3x + Sy 
y—3z=4, 
—2x + 4y +z =r. 


Suppose we also have x = 3t — 4u — v, y = —t + v, and z = 7t — 3u — v. 


x 
(a) Find matrix A such that the original system of equations can be expressed as A (z) = (5) ; 


x p^ 
(b) Find matrix B such that (2) =B (*) : 
z v 


SS 


I ae t 
(c) Suppose we wish to define the product AB such that (AB) («) =A ( (=) for all («| . Find the 
v v v 


\ 
ayy a2 M bu bu by 
Az 2 as ba bu bz]. 
43; 32 as) \ba bs2 ba 


7 


matrix that equals AB. 


Problem 11.8: Find the product 


Find the following products: 


Problem 11.9: 


=l A S 
S © =l 
i TES 


Problem 11.11: If A and B are 3 x 3 matrices such that Ai = Bi, Aj = Bj, and Ak = Bk, then must we have 
A = B? 


Problem 11.10: If A and B are 3 x 3 matrices, must we have AB = BA? 


Problem 11.12: 
(a) Find a3 x3 matrix I such that IA = A for any 3 x 3 matrix A. 


(b) Do we also have AI = A for this matrix I? 


(c) Is the matrix you found in part (a) the only matrix I such that IA = A for all A? Is it the only matrix such 
that AI = A for all A? 


Problem 11.13: 
(a) Find the matrix P such that for every A, the product PA results in a matrix whose third row is the same 
i as that of A, but the first two rows are swapped. (That is, the second row of the product is the first row 
of A and vice-versa.) 


(b) Find a matrix T such that T + I and T? = 1. 


11.2. 3x3 MATRICES 


Problem 11.5: Compute the following products: 


G 


Solution for Problem 11.5: 


(a) Applying the definition of multiplication of a vector by a matrix, we have 


a 7G (1)(0) + (3)(3) + (-4)(5) =11 
3 0 5 |{3)=( (-3)(0)4+()3)+6)5) | =| 25 }. 
L 1 Sea: (—6)(0) + (1)(3) + (-2)(5) = 


Qs =3 (0)(—3) + (—5)(2) + (2)(6) 2 
= 2 \=[ (-1)-3)+(1)2) + (-3)(6) | = [ -13 }. 
roe ee] 6 (5)(—3) + (—2)(2) + (-1)(6) -25 


(b) We have 


Solution for Problem 11.6: We have 


a 42 M3 kv ayykvy + ay2kv2 + akus A1101 + 41202 + 44303 
A(kv) = | an an az kv | = | anku + aogkv2 + do3kv3 | =k | aņvı + a02 +az3v3 | = k(Av) 


a3, 432 433 kv3 a3\kv1 + a32kv2 + 433kv3 A3101 + 3202 + 43303 
and 
An 2 An V1 + Wy ay (vı + w1) + a12(V2 + w2) + a13(03 + w3) 
A(v+w)= | an an 433 V2 + Wz | = | an (vı + w1) + az2(v2 + w2) + a23(03 + w3) 
A3) 432 433 v3 + W3 azı (V1 + w1) + a32(V2 + w2) + a33(V3 + w3) 
44101 + 44202 + 41303 441 W, + 422 + 4433 
= A2101 + 42202 + 42303 + | anw + d22W2 + A233 
43101 + 43202 + 43303 431W1 + 432W2 + 433W3 
= Av + Aw. 
g 


| Important: For any 3X3 matrix A, any scalar k, and any vectors v and win three dimensions, 


we have A(kv) = kAv and A(v + w) = Av + Aw. 


So, we see that the function that a 3 x 3 matrix represents is a linear function. 
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Problem 11.7: In this problem, we find the product of two 3 x 3 matrices. Consider the system of equations : 


ox + by = 22 — 9, 
y—32=4, 
-2x + 4y +z =fr. 


Suppose we also have x = 3t — 4u — v, y = —t + v, and z = 7t — 3u — v. 


cf P 
(a) Find matrix A such that the original system of equations can be expressed as A (3) = (a) : 


xX t 
(b) Find matrix B such that | y | =B| u |]. 
z v 
t t t 
(c) Suppose we wish to define the product AB such that (AB) | u | =A| B| u for all | u |. Find 
v v v 


the matrix that equals AB. 


Solution for Problem 11.7: 


(a) Following the definition of a 3 x 3 matrix in the introduction, we can write the initial system of equations as 


TREN 


(b) Again using our definition of a 3x3 matrix, we can express the system x = 3f—4u—v, y = —t+v,z = 7t-3u-v 


OGD 


(c) We'll take two different approaches to find AB. 
xX 


Method 1. If we substitute the expression we found for @ in part (b) into the equation we found in 
Z 


TEN a a a p 
o a Oo | uj=(q). EEN 
=) 4 i eae 1) Nee r 


Equivalently, we can substitute the given equations x = 3t — 4u —v, y = —t +v, z = 7t —3u —v into the original 
system of equations to get 


part (a), we have 


3(3t — 4u — v) + 5(—t + v) 271 — 3u - v) = p, 
(is ©) = S7 = 3 = O) = ih 
—2(3t — 4u — v) + 4(—t + v) + (7t — 3u —- v) =r. 
Simplifying these equations gives 
—10t - 6u + 4v =p, 
—22t + 9u + 4v =q, 
-3t + 5u + 5v =r, 


388 


11.2. 3x3 MATRICES 
which we can write as 


-10 -6 4 t p 
-22 9 4 “l= (eal ee 
-3 5 5 v r 


Comparing this to Equation (11.1), we have 


ð ð Sk 3 = =l t -10 -6 4 t 
© l =3 = U l u |=| -2 9 4 ul, (11.2) 
-2 4 1 7 = =Í v = 5 5 vj 


from which we conclude that 


5 5 2 ð 4 =í -10 -6 4 
0 1 -3 -1 0 1 J]J={|-2 9 4}. 
—2 4 #1 7 =p =Í = ð J; 
t 
Method 2. We use the definition of the product of a matrix and a vector to find AB. SinceB | u | = | yl, 
v 
t EN x 
(AB); u | =A Bl u =A|y}. 
v v z 


7 
x t 
In order to find the matrix that equals AB, we must express A | y | as a single matrix times | u |. We 
z v 


we have 


AAN 


x 
know how to multiply a vector by a matrix, so we express [y in terms of vectors. From the system 
Z 


x = 3t — 4u — v, y = —t + v, z = 7t — 3u — v, we have: 


ORORO 


x 
Substituting this into A ( gives 
z 


3 & 2 x 
8 y= 
-2 4 1 Z 


I 
oye 
ee. 24 

a 
aan 
aod 
ee 

= 

+ 
A T 
aoe Ae 
eu 

ie 


—10t — 6u + 4v 
—22t + 9u + 4v 
—3t + 5u + 5v 
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t x -10 -6 4\ /t 
v % -3 5 Uy 


5 
3 5 2 3 —4 -1 -10 -6 4 
from which we conclude that AB= | 0 1 -3 -1 0 1 =| -2 9 4). 
—3 -1 


-2 4 1 7 = 5 © 


Therefore, we have 


We're now ready to find a method to multiply any two 3 x 3 matrices. 


ere 
ayy a2 43 by biz bis 
Az, A22 A23 bo1 bz bz |} 
a31 A32 33 b31 b32 b33 


A 


Problem 11.8: Find the product 


Solution for Problem 11.8: We use part (c) of the previous problem as a guide. Let the matrices be A and B; we seek 
a matrix that equals the product AB, where 


a a 
(AB) | u |] =A] Bi u (11.3) 
V Uy a 

t 
for all | u |. We have 

v 
t bu biz bis t byt + bzu + bigu by biz bi 
B| u |=| by bo bz u | = | but + dou + bv | = | ba | t+ | boo | u+ {| bz |v. 
v bs b3 b33 v bait + bazu + b330 , b31 b32 b33 


Substituting this into the right side of Equation (11.3) gives 
t Ay 12 M3 by by. bis 
Al Bi u = į A7) ar 423 by | E+ bz ut | b3 |v 
v a31 A32 433 b31 b32 b33 
Ay, a2 M3 by ayy a2 413 biz ay, M2 433 biz 
= | 41 Ax 423 bo | t+ | an G22 a3 bon | u+ | a2 a2 Ar bo3 |v 
431 432 433 b31 431 32 433 b32 431 432 433 b33 


ayibi + 412b21 + 413531 411012 + 412b22 + a13b32 41113 + 412b23 + A13b33 

21041 + A22b21 + A23b31 | t+ | az1b12 + a22b22 + a23b32 | u + | A21b13 + a22b23 + a23b33 | v 
3104, + 432b21 + a33b31 431012 + A32b22 + A33b32 31043 + a32b23 + a3gba3 

(ma + a12b21 + 41303) 4đ11b12 + a12b22 + 413632 a1 b43 + a12b23 + cho (: 


az1b11 + 422b21 + a23b31 Ag by + a22b22 + a23b32 2113 + A22b23 + a23b33 u 
431011 + 432b21 + a33b31 3112 + a32b22 + a33b32 a31b13 + a32b23 + a33b33 v 


Comparing this to Equation (11.3), we have the desired definition of the multiplication of two 3 x 3 matrices: 


ayy M12 413 bu by bis anbi + ay2b21 + 443631 41012 + a12b22 + 413032 a11b13 + a42b23 + a13b33 
Az, An A23 by, buz bog | = | anbu +azbz +agb31 azxb12 + a22b72 + 493032 azxnbız + az2b23 + a23b33 


A31 A32 A33 b31 b32 b33 A31b11 + 432b21 + a33b31 431012 + a32b22 + a33b32 a31b13 + a32b23 + a33b33 
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Just as with the product of two 2 x 2 matrices, this definition is much easier to remember than it looks. We can 
view each row of a matrix as a vector, and each column of a matrix as a vector. The entry in row i and column j of 
the product AB is the dot product of the i row of A and the j® column of B. For example, the entry in row 2 and 
column 3 of AB is 


bis 
(an Ar? a23) bz | = anbis + az2b23 + a23b33. 
b33 


If we let the rows of A be the vectors ay, ay, and a3, and the columns of B be the vectors bj, bz, and b3, we can 
therefore express the product AB as follows: 


— a — a aa oe aytb a:b? a -bs 
AB = sap = b; b> bs = a? ` b; a2: b> a2- b3 E 
za ee | a3-b; a:b a3-b3 
We can also remember the entry in row i and column j of AB as the sum aj), j + Gi2b2; + ai3b3;. That is, this entry 


equals the sum of the three terms of the form Aixbj;, for k = 1, 2, and 3. 


Finally, we can also remember this matrix product with the same method we used to find it. Each column of 
AB equals A times the corresponding column of B: 


k ale al | | | 
AB=A (e b2 s) = (a Ab» a 
Po 4 | | | 


Problem 11.9: Find the following products: 
3 -1 4 -1 4 3 
(b) 0 1 7 3 5 =í a 
NS) 1 saline 


2 80 72 Se 
(a) =| 25 || 4 1 =2 
25 i ‘ 


Solution for Problem 11.9: We apply the rule for matrix multiplication we just learned to get the products below. 


(2)(4) + (—3)(-1) + (0)(5)  (2X(=1) + (-3)) + (0)X(0)  (2)2) + (—3)(-2) + (0)(-2) i a 
(a) { (-1)(4)+ E A (-1)(-1) + O (—1)(2) + (-2)(-2) + (5)(-2) | = | 23-1 -8 
(—2)(4) + (-5)(-1) + (IY) (—2)(-1) + (-5)) + DO) (2)(2) + (-5)(-2) + ()(-2) 2-3 4 


[sich 019-09 (3)(4) + (DS) + (41) OSTEO ) (3 del 3 
(b) z 


O 


(0)(—1) + (1)(3) A TT (0)(4) + (1)(5) + (MA)_——(0)(B) + (1)(“1) + (78) Lo es 
(2)(-1) + (-1)(3) + 3)) (4) + 16) + BQ) (2)(3) + E11) + BJO) —2 6 22 


(m 


Problem 11.10: If A and B are 3 x 3 matrices, must we have AB = BA? 


Solution for Problem 11.10: We test if AB and BA must be equal by reversing the order of the matrices in the first 
product we computed in the previous problem. So, we will compare the product 


4 -1 2 23 0 
= l =z = 2 5 
5 0 =2 =2 = í 
to the product we found in part (a) of Problem 11.9. Applying the definition of matrix multiplication, we find that 
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this product equals 


(4)(2) + (-1)(-1) + (2)(-2)  (®)(-3) + (-1)(-2) + 2)(-5)  (4)(0) + (=1)(5) + (2)) 5 -20 -3 
(—1)(2) + (1)(—1) + (-2)(-2) (—1)(-3) + (1)(-2) + (-2)(—5) (—1)(0) + (1)5) + (—2)(1) 1 3 
(5)(2) + (0)(-1) + (-2)(-2)  (5)X(-3) + (0)(-2) + (-2)(-5)  (5)(0) + (0)(5) + (-2)(1) 14 -5 -2 


This does not match the result we found in part (a) of Problem 11.9, so we conclude that AB and BA are not 
necessarily equal. O 


 WARNING!! Matrix multiplication is not commutative. That is, if A and B are matrices, 
“ then the products AB and BA are not necessarily equal. 


While 3 x 3 matrix multiplication is not commutative, it is associative and distributive. 


Important: For any 3 x3 matrices A, B, and Cc we have 


(AB)C = A(BC), 
A(B + C) = AB + AC, 
(A + B)C = AC + BC. 


Because matrix multiplication is associative, we can write a product such as A(BC) as simply ABC. 


The algebraic proofs of these properties are cumbersome and not terribly illuminating, so we won't include 
them here. 


Sidenote: Justas with2x2 matrices, we can see that 3x3 matrix multiplication is not commu- 
ay tative, but is associative, by thinking of matrix multiplication as the gempositiog 
of the functions represented by the matrices. 


Problem 11.11: If A and B are 3 x 3 matrices such that Ai = Bi, Aj = Bj, and Ak = Bk, then must A = B? 


My. a2 43 1 ayy 
Ay 22 a3 Oy = aa |. 
431 932 433 0 a3) 


Therefore, Ai equals the first column of A. The equation Ai = Bi then tells us that the first column of A is the 
same as the first column of B. Similarly, Aj equals the second column of A and Ak equals the third column of A, 
so Aj = Bj and Ak = Bk together tell us that the last two columns of A match the last two columns of B. Since 
each column of A matches the corresponding column of B, we have A = B. O 


Solution for Problem 11.11: We have 


Our work in Problem 11.11 gives us another way to think about matrices. The first column of a matrix A tells 
us what Ai is, the second column tells us what Aj is, and the third column tells us what Ak is. Let’s put this 
observation to work. 

Problem 11.12: 
= (a) Find a3 x3 matrix I such that IA = A for any 3 x 3 matrix A. 


(b) Do we also have AI = A for this matrix I? 


(c) Is the matrix you found in part (a) the only matrix I such that IA = A for all A? Is it the only matrix 
such that AI = A for all A? 
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Solution for Problem 11.12: 


(a) The first column of IA equals I times the first column of A. Since IA = A, the product of I and the first 
column of A must therefore be the first column of A. That is, we must have Iv = v for any vector v. Now, 
we can find I by letting v be each of i, j, and k. Since Ii = i, the first column of I is i. Similarly, since Ij = j 

and Ik = k, the second and third columns of I are jand k. 


So, if there is a matrix I such that IA = A for all A, then it must be the matrix whose columns are i, j, 
and k, in that order. Checking, we see that we do indeed have 


1 00 an An 443 M1 42 443 
0 1 0 an ao a3 |=| an az 423 |, 
0 0 1 A31 432 433 A31 432 433 


so IA = A for any 3 x 3 matrix A. 


(b) We have 
an 412 443 1 0 0 ay 42 443 
(= Ar 3 l 1 7 = (= a22 3 7 
A31 432 433 0 0 1 431 432 433 


so the matrix I we found in part (a) satisfies IA = A for any A. 


(c) Our work in part (a) shows that if IA = A for all A, then the columns of I must be i, j, and k, in that order. 
So, we have shown that this matrix I is unique. 


To see that this matrix is also the only matrix I such that AI = A for all A, suppose we have some matrix 
J such that AJ = A for all A. Now, we let A = I, the matrix we found in part (a). Then, we have IJ = I. But 
we have IJ = J because IA = A for any A, so we must have J = I. Therefore, the matrix I that we found in 
part (a) is the unique matrix such that AI = A for all A. 


Important: The 3 x 3 matrix I such that AI = IA = A for any 3x3matrixAis 
VY 


0 0 
I={0 1 0 
0 0 1 


We call this the identity matrix. 


Problem 11.13: | 
(a) Find the matrix P such that for every A, the product PA results in a matrix whose third row is the same f 


as that of A, but the first two rows are swapped. (That is, the second row of the product is the first row § 
of A and vice-versa.) 


Find a matrix T such that T + I and T? = I. 


b 


Solution for Problem 11.13: 


(a) We find P the same way we found the identity matrix, by finding Pi, Pj, and Pk. Since each column of PA 
equals P times the corresponding column of A, multiplying a vector by P exchanges the first and second 
rows of the vector but leaves the last row unchanged. Exchanging the first and second rows of i gives j, 
so Pi = j, which means the first column of P is j. Exchanging first and second rows of j gives i, so Pj = i, 
which means the second column of P is i. Finally, Pk = k, so the third column of P is k. Checking, we see 
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0 1 0 A1 412 413 Ax, an 423 
L O Q an ay 43 | =| 41 an mgl. 
0 0 1 431 432 433 431 432 433 


ue 


that we do indeed have 


(b) We could write out a 3 x 3 matrix for T, cube it, and then compare it to I, but that looks pretty terrifying. 
Trial-and-error doesn’t seem like fun either—who wants to cube random 3 x 3 matrices?!? 

Instead, we use part (a) as inspiration. There, we found a matrix P such that the product PA results in 
two rows of A being swapped. So, consider the product PPA. The product PA is just A with the first two 
rows swapped. Multiplying again by P to produce P*A simply swaps those two rows again, and gives us 
A back! So, P?A = A for any A, which means P? = I (as we showed in Problem 11.12). 

But we want T° = I, not T? = I. Rather than swapping two rows, what if we cycled them? In other 
words, perhaps we can find a matrix T such that multiplying by T to get TA moves the first two rows of 
A down one row and moves the bottom row to the top. In other words, we want the matrix T such that 
Ti = j, Tj = k, and Tk =i. These results tell us that the columns of T are, in order, j, k, and i. Checking, we 


see that we have 
0 0 1 ay. a2 m3 431 432 433 
100 Ay a2 23 | =| an an 413 |. 
0 10 431 432 @33 / Ay, An a3 


7 


Multiplying a matrix A by T moves each row of A down one row while the last row of A becomes the 
first row of the product TA. Performing this operation three times gives us the original matrix back, which 
means that T°A = A. So, we have found a matrix T such that T° is the identity. Let’s check: 


00 1 0 0 1 0 0 D 0 0 1 0 1 0 100 
1 0 0 1 0 0 100 =| i © 0 Lja l Oll 
010 0 10 010 0 1 0/\100 0 0 1 


The two matrices we found in Problem 11.13 are called permutation matrices. A permutation matrix is a 
matrix P such that for any matrix A, the rows of the product PA are just a reordering of the rows of A (or exactly 
the same ordering—the identity matrix is a permutation matrix, too). 


= > 


11.2.1 Find the following products: 


i ee l o” 2V 0-0 
Gere E | 4 (Oe es 
=a eG es Sea. 


=] = 6 Q =l g 
11.2.2 Compute the product | -2 4 0 l0 =æ =h 


=l =2 0 4 6 05 


Success! 


O 


11.2.3 Let k be a nonzero constant. Find a matrix M such that MA = kA for any matrix A 
11.2.4 Is the matrix T we found in part (b) of Problem 11.13 the only matrix T such that T + I and T° = I? 
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11.3 Determinants of 3 x 3 Matrices 


We introduced the determinant of a 2 x 2 matrix by solving a system of two linear equations in two variables. So, 
you might not be surprised that we introduce the determinant of a 3 x 3 matrix by solving a system of three linear 
equations in three variables. 


Problem 11.14: Consider the system of equations 


ax + biy + cz = di, 
ax + boy + coz = do, 
| a3x + b3y + c3z = ds, 
| 


ar by 
a3 bz 


; b 
where the 4;, b;, ci, and d; are constants. Show that if a; 


5 # 0, then 
bs 


Solution for Problem 11.14: We start by eliminating z from the second and third equations. Subtracting c2 times the 
first equation from c; times the second equation and subtracting c3 times the first equation from c; times the third 
gives the system 


(a2c1 — a1c2)x + (boc, — b1c2)y = doc, — dica, 
(a3c1 — a1C3)x + (b3c1 — bıc3)y = d3c1 — dıc3. 


aot, — ayC2 bocy = bic 


+ 0. Let's take a closer look 
a3C, ~ a1C3 b3C1 = bic3 


We can use Cramer’s rule to solve for x and y if and only if 
at this determinant. We have 


A2Cy —A1C2 boc, — by C2 


= (a0C1 — a1C2)(b3c1 — bıc3) — (boc, — b1C2)(a3c, — a163). 
eee — bie: (a2c1 — a C2)(b3c1 — bıc3) — (baci — b1C2)(a3c1 — a1c3) 


When we expand the products, the @,b;c2c3 terms cancel. All remaining terms have at least one factor of cy. 
Factoring cı out of everything and grouping the remaining terms by factors a1, b1, and c1, we have 


aoc, — A1C2 boc} = bicz 


E, b3C1 7 bic = C1 [ay (b2c3 = b3c2) + bi (c243 = C342) ar en (b3a2 = bza3)]. 


We recognize the expressions in the inner parentheses as determinants of 2 x 2 matrices, and we have 


= - bo c ay c a b 
aa o boci — byC2| _ 7 (a a |e Nez 2 ) l (11.4) 
a3Cı — a13 b3c, — bic3 b3 c3 a3 C3 az b3 
f ; ; ; dy C2 C2 2 j ; : 
(We'll explain why we write the middle term as =b; dane rather than b; ae later in this section.) 


We need the determinant on the left side of Equation (11.4) to be nonzero in order to be able to apply Cramer’s 
rule to the system 


(ancy — a4C2)x + (bec, — byc2)y = dzc — dic2, 
(a3cy = a1C3)X ar (b3c1 = bic3)y = d3C 1 = dıc3. 
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We are given that the expression in parentheses on line (11.4) is nonzero. But we'll also need cı + 0 in order to 
apply Cramer’s rule. We’ll tackle the case in which c1 # 0 now, and defer the case in which c1 = 0 to Problem 11.18. 


WARNING! We have to be very careful about 0, and often have to deal with 0 as a separate 


case when constructing proofs and solving problems. 


Assuming that c # 0, the determinant on line (11.4) is nonzero, and we can apply Cramer’s rule to find 


dcı = dicz bzc1 = bicz 
d3c1 = dıc3 b3c1 az: bic3 


acı — AC. bc, — bıc2 
43C1 —41C3 bacı — byc3 


Following the same expansion, cancellation, and factorization steps as above, we find that 


d bo C2 = dz C2 Tr dz bo 
_ cldi(bzcs — bse) + bı (Cods — cad2) + cı (bada — bodds)] _ [bs ca| " |ds cs| ° |ds bs 
cılaı(b2c3 — b3c2) + by(c2a3 — c3a2) + cı (baz — b2a3)] by c a © “a b 
ay = by + Cj) 
b3 c3 a3 C3 a} b3 


As a reminder, we’ll deal with the case in which cı = 0 in Problem 11.18. Oo 


We use the unwieldy expressions in the numerator and denominator of our result for x above to define the 
determinant of a 3 x 3 matrix. 


Definition: We define the determinant of a 3 x 3 matrix as follows: 


Cc a> b 
Leg ee 
C3 a3 b3 


= ab2c3 + byc2a3 + C1 a2b3 — 44C2b3 — bya2¢3 — c1b243. 


We can also denote the determinant of a matrix A as det(A). 


ay by Cy 
Letting A = (> bz 3 , we can write the original system in Problem 11.14 as 
az b3 c3 


d b a d bh a 
dy b c dy b c 
d3 b3 c3 d3 b3 c3 
x= — TW = 
2 C2| a2 C2 UD) bz det(A) 
i b Cs by 43 C3 g. 43 b3 


` 


This gives us an expression for x that is very much like the expression we found in Cramer’s rule for systems of 
two linear equations with two variables. Indeed, we can go through essentially the same steps to show that if 
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det(A) # 0, then the system has the solution 


dy by C1 ay dy Cy ay bi dy 

do bo C2 A2 dz C2 ag bo do 

dz; b3 c} _ |B dz c3 _ |8 b3 dz 
det(A) ” Y= Get(A) ' ~ det(A) 


We call this Cramer’s rule for a three-variable system of three linear equations. 


Cramer’s rule is just one application of determinants of 3 x 3 matrices. You'll encounter applications of 
determinants in various areas of mathematics, physics, economics, and engineering. 


Looking back at our solution to Problem 11.14, we see one particularly useful application of the determinant. 
Specifically, our work there, together with corresponding work solving for y and z, shows that the system 


ayX + biy +¢4Z =d; 
AX + boy +z = dz, 
A3X + bsy + ¢3Z = d3, 


has a unique solution for x if 


m b a 
a2 b> Co| # 0 
a3, b3 c3 


In other words, if the determinant of the matrix given by the coefficients of the variables in the system is nonzero, 
then the system has a unique solution. 


Important: For any matrix A and any vector c, if det(A) # 0, then there is a unique vector 
Vv v such that Av = c. i 


e 


Just as with 2 x 2 matrices, we say that a 3 x 3 matrix is singular if its determinant is 0 and nonsingular if its 
determinant is nonzero. Therefore, the system Av = c has a unique solution if A is nonsingular. 


As you'll see in Section 12.5, we can use this observation as a starting point to determine whether or not a 
given 3 x3 matrix has an inverse. But we’ll need a few more important concepts before we can tackle that, so now 
we'll get more accustomed to the determinant by exploring some of its properties. 


When you're first getting started with the determinants of 3 x 3 matrices, it can be pretty 4 pa ada 
hard to remember how to compute them. One handy way of remembering the expression bh > l Ds h "b 
for the determinant of a 3 x 3 matrix is shown at right. We copy the first two columns of e esx cow 
the matrix after the third column as shown. We add the products of three numbers along 
the three northwest-to-southeast diagonals (the diagonals along solid gray lines in the diagram) and subtract the 
products of three numbers along the three northeast-to-southwest diagonals (those along dashed gray lines in the 
diagram). This is sometimes referred to as the Rule of Sarrus. 


Problems 


Problem 11.15: Evaluate each of the following: 
go. 0 
7 =2 
=o A 


397 


CHAPTER 11. VECTORS AND MATRICES IN THREE DIMENSIONS, PART 1 


ay by Cy 
Problem 11.16: Let A = (z bz 3 . The transpose of a A, which is denoted as A’, is defined as follows: 


az b3 c3 
a, a2 a3 
A’=([b, by b |. 
(| (Gp (ee 


We can think of the transpose as the result of “flipping” the matrix over the diagonal from upper left to lower 
right of the matrix. Show that det(A) = det(A’). 


Problem 11.17: 
(a) Suppose each entry in one row of A is multiplied by some constant k. How is the determinant of the new 
matrix related to det(A)? 
(b) Suppose each entry in one column of A is multiplied by some constant k. How is the determinant of the 
new matrix related to det(A)? 


(c) How is det(kA) related to det(A), where A is a 3 X 3 matrix? 


Problem 11.18: 
(a) Suppose B is the matrix formed when we swap the first and second rows of A. How are det(A) and 
det(B) related? 


(b) Patch the hole in our solution to Problem 11.14 by using part (a) to address the case in which c = 0. 


Problem 11.19: 
(a) Show that if one row of A is a constant times another row, then det(A) = 0. 
(b) If det(A) = 0, then must either one row of A be a constant times another row or one column be a constant 
times another column? 


Problem 11.20: 
ay 
(a) Show that |a2 
a3 


ay 
(b) Show that |a2 
a3 


Problem 11.21: 
1 


3 4 -3 2 -1 
(a) Howare|2 -1 2|+|2 -1 2ļand|2 -1 2) related? 
Sam 3 5S 0 3 5 0 3 


(b) Notice that the first row of the final matrix in part (a) is the sum of the first rows of the other two matrices, 
and that the last two rows of all three matrices are the same. Does the relationship you found in part (a) 
generalize? That is, must we always have 


a4 bh a ai ci a, +t ay bi + bi G ci 


a b Col + |a? 
a3 bs C3 a3 


Must we have det(A) + det(B) = det(A + B) for any matrices A and B? 
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> Pees i SD , ea 
Problem 11.22: Let A= | -1 4 2 | andletB={ 2 13 17 |, where B is the matrix formed by adding 3 
i Se 0 E -3 0 6 
times the first row of A to the second row of A and leaving the other two rows unchanged. How are det(A) and 
det(B) related? Can you generalize this observation? 


Problem 11.23: Evaluate each of the following: 
13 -26 39 2009 10050 —2010 


a (b) |328 1640 -328 
A Epa -602 -2990 598 


We start with a little practice evaluating determinants of 3 x 3 matrices. 


Problem 11.15: Evaluate each of the following: 


= 
So Z 
=) 2 


Solution for Problem 11.15: 


(a) We have 
4 -1 0 
7 -2 5 ç =2 oer 
sie -2|= 4 -vl sol” 4 
3 2 1 -2 1 o Íl 3 =? 
= 4((7)(1) — (—2)(—2)} + 1(6)() — (-2)(-3)) + 0 = 43) + 1-1) = 11. 
(b) We have 
Ge al 
l 5 4 -5 4 1 
4 1 -5 =5| |-3|4 renji | 
32 6 2 6 2 6 om2 
= 5((1)(6) — (2)(-5)) — 3((4)(6) - (—5)(-3)) + (-1)(4)2) - 1)(-3)) 
= 5(16) — 3(9) + (-1)(11) = 42. 
o 


ay by Cy 
Problem 11.16: Let A = (> bz a) . The transpose of a A, which is denoted as A'T, is defined as follows: 


a3 b3 c3 
ay A2 A3 
Al = b bo b}. 
CG C2 C3 


We can think of the transpose as the result of “flipping” the matrix over the diagonal from upper left to lower | 
right of the matrix. Show that det(A) = det(A”). : 
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eS ane 


Solution for Problem 11.16: We have 


det(A) = a1b2c3 + byc2a3 + c1a2b3 — a1C2b3 — bya2c3 — c1b243 


and 
a M2 a3 
det( A7) = |b, b} b3| = ayboc3 + az2b3c1 + azbıc2 = a1b3c2 — a2b1C3 — a3b2c1 
C1 C2 C3 


= a1b2C3 aF c142b3 + bic243 = A4C7b3 = bya2C3 = €1b203 
= aybec3 + bic243 + C1azb3 = A\C2b3 = bya2c3 = €1b243, 


so det(A) = det(A’). o 


Important: For any matrix A, we have det(A’) = det(A), where A’ is the transpose of A. 


am NO a a L aaa E S 


We’ll see why this fact is useful in the next problem. 


Problem 11.17: 
(a) Suppose each entry in one row of A is multiplied by some constant k. How is the determinant of the 
new matrix related to det(A)? 


(b) Suppose each entry in one column of A is multiplied by some constant k. How is the determinant of the 
new matrix related to det(A)? 


How is det(kA) related to det(A), where A is a 3 x 3 matrix? 


(c) 


Solution for Problem 11.17: 
(a) We have 


kaı kb, kcı 
A2 bz C2 
a3 b3 C3 


= kayboc3 + kbyc2a3 + kc a2b3 — kaycob3 — kbyarc3 — key b2a3 


= k(aybec3 + byc2a3 + c142b3 = a1C2b3 = byarc3 z= cıb24a3) 


a b ca 
=K ad bo C2 
a3 b3 c3 


So, multiplying the first row of a matrix by k multiplies the determinant by k. Similarly, if we multiply any 
row or any column by k, we multiply each term in the determinant by k, thereby multiplying the whole 
determinant by k. 


(b) Rather than going through all the computations, we can use part (a) together with what we just learned 
about the determinant of the transpose of a matrix. Suppose we multiply the first column of A by k. Since 
the determinant of a matrix equals the determinant of its transpose, we have 


kay by Cy kay kaz kag ay by C1 
det | kaz bz cp ]=det| bi b b, | =kdet| a bz co | =kdet(A7) = kdet(A), 
kaz b3 C3 Cy C2 C3 a3 b3 C3 


where we used part (a) to “factor” k out of the first row. We can go through similar steps if any other 
column is multiplied by k to show that multiplying any column of a matrix multiplies its determinant by k. 
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(c) Multiplying a row by k multiplies the determinant by k. To produce kA, we multiply each of the three rows 
by k, thereby multiplying the determinant by k three times. So, we have det(kA) = ke det(A). 


(m) 


Important: If we produce matrix B by multiplying a row or a column of A by some constant 


Y k, then we have det(B) = kdet(A). Therefore, for any 3 x 3 matrix A, we have. 
det(kA) = K det(A). 


eo oF a a I a 


Our solution to part (b) shows how useful the fact that det(A) = det(A?) can be. 


‘Concept Because det(A) = det(A’), for anything we prove regarding the rows and the 


Q= determinant of a matrix, we can prove a corresponding fact about the columns 


| and the determinant of a matrix. < 


| Problem 11.18: 
| (a) Suppose B is the matrix formed when we swap the first and second rows of A. How are det(A) and 
det(B) related? 


(b) Patch the hole in our solution to Problem 11.14 by using part (a) to address the case in which c; = 0. 


Solution for Problem 11.18: 


(a) Wehave 
ma b a 
ad bo Co] = A1 boc3 + bi c243 + C1 a2b3 = a4C2b3 = by a2C3 = (6) boA3, 
az b3 c3 
m b C2 
ay by (| = anbic3 T boc 43 + CA1 b3 — A2C1 bs = boaiC3 = c2b143 
a3 b3 c3 


= bia2c3 + cıb243 + a,Cob3 = C1a2b3 = ayboc3 = byC243. 


We see that the terms in the second determinant are the opposites of those in the first determinant. (We 


have abzc3 in first and —a,b2c3 in the second, and so on.) Therefore, the second determinant is the negative 
of the first. 


We can go through essentially the same manipulations as above to show the following: 


Important: If we produce matrix B by swapping any two rows or any two columns of A, 
VV then det(B) = — det(A). 


i I 


i 


a3 b3 C3 
then the system of equations 


ay by Cj 
(b) LetA= (> bo 3 . In Problem 11.14 and the discussion thereafter, we showed that if det(A) and cı # 0, 


ax + biy +cz = di, 
mx + boy + Coz = do, 
a3x + b3y + c3Z = d3, 
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has the unique solution 


d b cy a dy Cy ay by dı 
dy br C2 ay dy Cp ay b d 
dz b3 c3 a3 d3 c3 az b3 ds 
= Ce Z = LI ER TO 
det(A) ” I= ~“Get(A) det(A) 


To deal with the case in which cı = 0, we note that if cı = 0, then either cz or c3 must be nonzero, since 
otherwise we would have det(A) = 0 (because there is a factor of the form c; in each term of the expansion 
of det(A)). 


Suppose c2 + 0. Then, we can view our system of equations as 


ax + boy + coz = do, 
ax + biy +z = di, 
a3X + b3y +¢3Z = d3, 


where all we have done is swapped the first and second equations. Then, our work in Problem 11.14 tells 


us that 
d bo c 
dı by C1 
d3 b3 c3 
4 ay b c| 
a b c 
az b3 c3 


Applying the result of part (a), we swap the first and second rows of each determinant and have 


dy bo c2 ad, b c d b c 

d b c -|d by & dy by c 

d3 b3 C3 d3 b3 C3 d3 b3 C3 
* ao bz C2 a ay by Cy p det(A) Í 

a b c — |a bo © 

az b3 c3 az b3 c3 


as desired. We can do essentially the same for y and z, so Cramer’s rule holds if det(A) + 0, cı = 0, and 
C2 +0. 

Finally, we can go through essentially the same steps as above if c1 = cp = 0 and c3 + 0, by swapping 
the first and third equations in the original system of equations. We have therefore patched the hole in our 
solution to Problem 11.14 and shown that the system has the unique solution given above even if c = 0. 


Problem 11.19: 
- (a) Show that if one row of A is a constant times another row, then det(A) = 0. 


(b) Ifdet(A) = 0, then must either one row of A be a constant times another row or one column be a constant 
times another column? 


Solution for Problem 11.19: 
(a) Ifthe third row is k times the second row, then we have 


ay bı C1 
a2 bo C2 | = 41 
kay kba kc 


bz C2 ag C2 
kb) kcz 


7% kaz kcz 
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In each of the 2 x 2 determinants, one row is a constant times the other, so all three of these determinants 
are 0, which means that the determinant of the 3 x 3 matrix is zero. 


If the first row is a constant times the second row, then we can swap the first and third rows and use the 
proof above to find that the determinant of the resulting matrix is 0. The swapping of rows only reverses 
the sign of the determinant, so the original matrix has determinant 0 as well. Similarly, if the first row is a 
constant times the third row, we can swap the first and second rows and use a proof similar to that above 
to show that the determinant of the original matrix is 0. 


Important: If one row of a 3x3 matrix isa constant times another row, or one column of the 


Y matrix is a constant times another column, then the determinant of the matrix 
is 0. 


No! Here’s a counterexample: 


(b) 
lO 2 
P ; : = (DG) + (0)(1)(1) + (2)(0)(1) -OA — (0)(0)(3) - (20)0) = 0. 
No row is a constant times another and no column is a constant times another, and yet the determinant is 
still 0. We'll learn what’s going on here in the next chapter. 
T 
Problem 11.20: 


ay 

a) Show that jaz b c|=-—a =C | 

(a) ‘ *\b3 c3 a3 c| 7 \a3 bs 
a3 b3 c3 
a b c 


(b) Show that |a2 


Solution for Problem 11.20: 


(a) The desired expression on the right side looks a lot like our definition of the determinant, except each term 


on the right has an entry in the second row times a 2 x 2 determinant, rather than an entry in the first row 
times a 2 x 2 determinant. By swapping the first two rows in the 3 x 3 matrix, we produce a matrix with 
determinant —1 times the original matrix. The new matrix has a, bz, and c2 in the first row, which gives us 
the desired right side when we express the determinant with our definition: 


a b c a2 b c 
1 1 1 2 2 2 by Cy t c ay by 
a2 bo Co| = — |ay bi C| = — |2 —b + C2 
b3 C3 a3 C a3 b3 
az b3 c3 az bz c3 
bi c a a, b 
Sea OR Coa 
b3 C3 az C a3 bz 


as desired. 


(b) We wrote the expression for a 3 x 3 determinant in terms of 2 x 2 determinants by selectively factoring out 


first row terms when writing the expression 


ay by Cy 
a bz c| = a4(b2c3 — c2b3) — by (arc3 — C243) + C1 (a2b3 — bza3) 
a3 b3 C3 


emer 
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If we instead expanded all these products and grouped the resulting terms by the third column entry that 
appears in each term, we have 


ay by C1 
a b} c| = a1b2c3 — A1C2b3 — biazc3 + bic2a3 + c1a2b3 — C1 b2a3 
a3 b3 c3 


= c (ab — bza3) — c2(a1b3 — bya3) + c3(a1b2 — b102) 


a by 
a3 b3 


m b 


a b 
„o A 1 
a3 bs 


=C Tarte 
1 2 ° Jaz bz 


(We could also have solved part (a) similarly, by grouping terms with a, terms with bz, and terms with c2.) 
Oo 


Our initial definition of the determinant of a 3 x 3 matrix and the two equivalent expressions we found in the 
previous problem are examples of a process called expansion by minors. A minor of a matrix A, denoted as Ajj, 
is the determinant of the matrix that remains when row i and column j are omitted from matrix A. So, if 


A141 412 413 
A= | an 422 43 |, 
431 432 433 


then we find Ay) by omitting row 1 and column 2, and then taking the determinant of the remaining 2 x 2 matrix: 


Az, ab a3 = Ay = 
431 2 43 


Similarly, we find A3; by omitting row 3 and column 1, and taking the determinant of the remaining 2 x 2 
matrix: 


=> A31 = 


With this notation, we can express our definition of the determinant as follows: 


Ay, A12 413 


; 422 A23 A21 423 A22 423 
det | 421 422 423 | =an — M2 
A32 433 A31 433 432 433 
431 432 433 
= a1 Aq1 — 412412 + 413413. (11.5) 


Our work in part (a) above shows that we can “expand along the second row,” too: 


441 A12 433 ep aun ee a 
13 
det | an 422 3 | = —ay — 473 
432 433 A31 A33 A32 33 
A31 432 433 
= -41 An + anÁn — a73A93. (11.6) 


By “expand along the second row,” we mean that each term in our evaluation of the determinant is the product 
of an entry in the second row, which is az; for j = 1, 2, or 3, and the corresponding minor A2;. As we have seen, 
sometimes these terms are added and sometimes they are subtracted. There’s a pattern to which are added and 
which are subtracted: we add when the sum of the subscripts of the minor is even, and we subtract when that 
sum is odd. 
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Noting that (D*+ is -1 if i + j is odd and 1 if i + j is even, we can express the “expansion along the third 
column” in part (b) as 


441 An 43 z a z > F ay 
21 422 11 4412 11 2 
det | 421 an 423 | = a3 — a3 + 433 
431 32 431 432 421 422 
431 432 N33 
pe 1+3 2+3 3+3 
= (—1) "a313 + (—1) 7 a23A23 + (—1)" a33433. (11.7) 


That we can expand along any row as described above can be expressed as 
det(A) = (-1)+anAn + (-1)a2Az + (-1)agAis 


for i = 1,2, or 3. (The result for i = 1 is line (11.5) and the result for i = 2 is line (11.6).) Similarly, we can express 
the fact that we can expand along any column in this way as 


det(A) = (-1)'/ayjAq; + (-1)?"/a2jAo; + (-1)?/a3jA3j 
for j = 1, 2, or 3. The result for j = 3 is line (11.7). 


Sidenote: Some sources use cofactors of a matrix rather than minors. A cofactor of a matrix 
N for row i and column j of a matrix A is (-1)'*/Aj;, where Ajj is a minor of A. In 
other words, a cofactor is formed by multiplying a minor by 1 or —1, depending 

on the sign of that minor in expansion by minors. | 


Writing out all six products in the determinant makes it clear why we can expand by minors along any row or 
column. We start with 


Ja b a 
a2 b> C2) = a, b2c3 ae byc2a3 oP C142b3 = A4C2b3 = by 4203 =C] b2a3, 
a3 b3 c3 


and note that each term is of the form a;b;cķ, where 1, j, k are 1, 2, 3 in some order. So, each term has one factor 
corresponding to each column (the letters) and one factor corresponding to each row (the numbers). We can group 
by any number to produce an expansion along a row, such as 


aig vine bi c ay Ay 1 
m b cj = a(b3c1 — byc3) + bo(aic3 — a3c1) + C2(a3b1 — a1b3) = —a2 ye elias bz 3 e pale 
ee 3 C3 3 b 


or we can group by any letter to produce an expansion along a column, as we did in part (b) of Problem 11.20. 


Problem 11.21: 
3 4 -3 2 -1 -2 5 3 


(a) Howare|2 -1 2ļ|+|2 -1 2jand|2 -1 2]j|related? 
5 0 3 5 0 $ 5ST’ 
(b) Notice that the first row of the final matrix in part (a) is the sum of the first rows of the other two f 
matrices, and that the last two rows of all three matrices are the same. Does the relationship you found 
in part (a) generalize? That is, must we always have 


7 A / t $ 


Must we have det(A) + det(B) = det(A + B) for any matrices A and B? 
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Solution for Problem 11.21: 


(a) We have 
1 3 4 
: -1 2) = (1)(—1)(3) + (3)(2)(5) + (4)(2)O) — (4)(2)(0) — (8)(2)(3) - (4)(-1)) = 29, 
0 3 
-3 2 -i 
2 -1 2 = (-3)(-1)(3) + (2)(2)(5) + (—1)(2)(0) — (-3)(2)(0) — (2)(2)(3) — (-1)(-1)@) = 12, 
oe 0 3 
-2 5 3 
2 -1 2} = (—2)(-1)(3) + (5)(2)(S) + (3)(2)(0) — (—2)(2)(0) — (5)(2)(3) — B)(-1)(S) = 41, 
SEOS : 


and we see that the sum of the first two equals the third. 


(b) Expanding each determinant in the sum along the first row gives: 
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u / i 
z bo C2} + |a2 bz Co| = ay — bı + Cj 
b b 
3 C3 43° C a3 
az b3 c| |a3 b3 c3 
bo c a> C a> b 
+a 2 2 x bi 2 2 F ci 2 r 
b3 c3 a3 C3 a3 b3 
bo c a> © 2 bo 
=(4+a')| 7]-(@ +01) |? Cite, 
b3 C3 43° C3 a b3 
ay +a, by +b} Ci G 
=| @ by C2 
a3 bz C3 


There’s nothing special about the first row. For example, suppose two matrices have the same first and 
third rows. The sum of the determinants of these matrices equals the determinant of the matrix with those 
same first and third rows and with second row equal to the sum of the second rows of the original two 
matrices: 


a b c a4 b cy ay by Cy 
az by colt+|a, bf c&|= az +a, b2+b, c2+c3l. 
az b3 c| |a3 b3 c3 a3 bs C3 


There’s nothing special about rows, either. If two columns in one matrix are the same as the corresponding 
two columns in the other, then the sum of the determinants of these matrices equals the determinant of the 
matrix with those same two corresponding columns and with the other column equal to the sum of the two 
differing columns in the original two matrices. For example, we have 


a b caj la b c 
ag bz C2| + 142 by ch 
a3 b3 C3 a3 b3 C3 


a, by ct ci 
=|a42 b c +c, : 
a3 b3 C3 + C3 


No. We can find a counterexample quickly by letting A = B = I. Then, we have det(A) = det(B) = det(I) = 1, 
so det(A) + det(B) = 2, but 


2 00 
det(A+B)=|0 2 0) =2?=8. 
0 0 2 


So, det(A) + det(B) and det(A + B) are not necessarily the same. O 


11.3. DETERMINANTS OF 3 x 3 MATRICES 


WARNING!! Jf A and Bare 3x3 matrices, then det(A) + det(B) and det(A + B) are not | 
a necessarily the same. 


With a great deal of perseverance, we can show that det(A) det(B) = det(AB) for any two 3 x 3 matrices A 
and B. 


Important: For any two 3 x 3 matrices A and B we have det(A) det(B) = det(AB). 


Y 


| a ies il eens ie) | 
| Problem 11.22: Let A = (A 4 2]andlettB=( 2 13 i7 , Where B is the matrix formed by adding $ 
| -3 0 6 = 0 ©; 
| 3 times the first row of A to the second row of A and leaving the other two rows unchanged. How are det(A) f 
and det(B) related? Can you generalize this observation? 


Solution for Problem 11.22: We have 


det(A) = (1)(4)(6) + (3)(2)(-3) + (5X(-1)(0) — (1)(2)(0) — (3)(-1)6) — (5)(4)(-3) = 84 


and 
det(B) = (1)(13)(6) + (3)(17)(—3) + (5)(2)(0) — (1)(17)(0) — (3)(2)(6) — (5)(13)(—3) = 84, 
so det(A) = det(B). 


There doesn’t appear to be anything special about A or B. So we suspect that, in general, if we add a constant 
times one row to another row of a matrix, then the resulting matrix will have the same determinant as the original 
matrix. 


Rather than starting from the definition of the determinant to generalize this observation, we instead combine 
properties of the determinant that we discovered in earlier problems. Suppose that T is formed by adding k times 
the first row of M to the second row of M. From the property we discovered in Problem 11.21, we can write det(T) 
as: 


mıı mMı2 m13 Miu Miz 1143 Mii Mı? Mis 
det(T) = |M + km mz + kmız m23 + km43 = {127 M22 M3) + km km km43 7 
1113) 11132 133 ™3, M32 M33 m31 m32 33 


The first determinant on the right end is just det(M). The second is the determinant of a matrix in which one row 
is a constant times another. We showed in Problem 11.19 that the determinant of such a matrix is 0, so we have 
det(T) = det(M). O 


By working through essentially the same steps as we did above, we can show the following: 


Important: For any matrix M, the determinant of a matrix formed when we add a constant 
times any row of M to another row of M equals det(M). Similarly, the deter- 
minant of a matrix formed when we add a constant times any column of M to 

another column of M equals det(M). 


We usually have to grind out a bunch of multiplications to evaluate the determinant of a 3 x 3 matrix, which is 
why we typically use a computer to evaluate them. But sometimes we can use the properties we have discovered 
in this section to evaluate a determinant pretty quickly. 
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Problem 11.23: Evaluate each of the following: 
iS) =26" 7 39 2009 10050 -2010 


(aj |S 32 2B (b) |328 1640 -328 
4 16 -12 —602 -2990 598 


Solution for Problem 11.23: 


(a) Having Os in a matrix makes finding its determinant easier since some of the products in the expression for 
the determinant will be 0. So, we subtract twice the third row from the second. We also can factor 13 out of 
the first row and 4 out of the last row, so we have: 


13 -26 39 ld =e Îi 2 3 l = á ə 
e o eA ASO 0 1 |=13|J0 0 1) = 13740 0 1 
4 16 -12 4 16 -12 aA o = i a = 


= 52((-2)(1)(1) - (1)(1)(4)) = —312. 


(b) We factor 10 out of the second column, and add the first column to the third to give us a 0. This gives 


2009 10050 -2010 2009 1005 -2010 2009 1005 -1 
328 1640 -328 | = 10| 328 164 -328 |= 10| 328 164 0 
—602 -2990 598 —602 -299 598 H A 


Subtracting twice the middle column from the first will give us another 0: 


2009 1005 = = ive = 
10/328 164 O|=10/0 164 0 
602 -299 -4| |-4 -299 -4 


Aha! The first and last columns are the same, so the determinant equals 0. 


= ~ 


11.3.1 Compute the following two determinants: 


1 5 0 3 4 05 
Err) 6 (b) 0.25 -6 -1 
m 3 o 2 


11.3.2 Show that (A+B)? = A? +B’. 
11.3.3 Show that if A and B have the same second and third columns, then det(A) + det(B) = det(A + B). 


30 51 20 
11.3.4 Evaluate | 15 24 10 | without a calculator or computer. 
-46 28 -29 


11.3.5 An upper triangular matrix is a matrix for which all the entries below the main diagonal are 0. That is, it 
is a matrix of the form 
4i An 43 
0 an a3 
0 0 A33 


What is the determinant of such a matrix? 
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11.4 More Than Just 2 x 2 and 3 x 3 


We introduced 2 x 2 and 3 x 3 matrices with systems of equations in which the number of variables equals the 
number of equations. But what if the number of equations does not equal the number of variables? For example, 
can we use matrices and vectors to express this system: 


UI OY = 22 — 4, 
—20 +x- y + oz =b? 


Yes, we can: 


Ne eR 
Il 
ES 
w a 
Se 


On the left side, we have a couple of mathematical objects we haven’t seen before, a matrix with 2 rows and 4 
columns, which we call a 2 x 4 matrix, and a vector with 4 components. The product of the 2 x 4 matrix and the 
vector in 4 dimensions gives us a vector in 2 dimensions. In other words, the 2 x 4 matrix represents a linear 
function that maps a vector in 4 dimensions to a vector in 2 dimensions. 


In general, we can make a matrix with any positive integer number of rows and any positive integer number of 
columns, and a vector can have any positive integer number of components. In fact, the vectors we have studied 
so far can be viewed as a matrix with only one column. We call such a vector a column vector, and a matrix 
with only one row can be considered a row vector. So, for example, each of the c columns of an r x c matrix (a 
matrix with r rows and c columns) is itself a vector with r components, while each of the r rows is a vector with c 
components. 


Collectively, we call the number of rows and the number of columns of a matrix the dimensions of a matrix. 
We read the dimensions r x c as “r by c”. 


As we have seen, we can use the product of an r x c matrix and a vector with c components to represent the 
expressions on one side of a system of r linear equations with c variables. In general, we define the product of an 
r xc matrix and a vector in c dimensions analogously to how we defined the product of a 2 x 2 matrix and a vector 
in 2 dimensions: 


m m i vi 114101 + 14202 + M1303 + °° + M1icVe 
m eee 
i aie iy fa U2 MAVI + M2U2 + M1303 + +++ + McUe 
m m eee 

a E = a v3 = M3101 + 3202 + 13303 + - ++ + M3cVe 
m Mp m3 > m i 

a Va i Ü; MnV1 + M,202 + M303 + +++ + MycVe 


Note again that the product of an r x c matrix and vector in c dimensions is a vector in r dimensions. So, an r X c 
matrix represents a linear function that maps a vector in c dimensions to a vector in r dimensions. 


Important: The product of an r x c matrix and a vector in n dimensions is defined if and 


WW only ifc =n. 


We define the addition of two vectors in n dimensions and scalar multiplication of vectors in n dimensions just 
as we did in 2 and 3 dimensions: 


vi wi vı + W V1 kv, 

V2 W2 V2 + We U2 kvz 
v+w= _ |+ = i : kv=k| . = 

Vn Wy Un + Wn Un kün 
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Note that we cannot add two vectors that have differing numbers of components. 


The dot product and the norm of vectors in n dimensions are defined just as they are in 2 and 3 dimensions: 


‘Important: For any vectors v and w in n dimensions, we have ! | 
VW = 0104 + UW +++ + UnWy, 


livi! = v a v3 AE oe 


|... Problems [ig 


Problem 11.24: In this problem, we explore matrix addition. 


(a) We can define the sum of two vectors only if the two vectors have the same number of components. 
Show that Av + Bv is defined if and only if A and B have the same dimensions. 


(b) Let Aand B ber x c matrices. We'd like to define the matrix sum A + B such that (A + B)v = Av + Bv for 
all v. Show that such a matrix A + B exists and explain how to compute it. 


Problem 11.25: We define the matrix kB such that (kB)v = k(Bv) for all v. Show that each entry in kB is k times 
the corresponding entry in B. 


Problem 11.26: Let A be an r x s matrix and B be a q x c matrix. Suppose there are vectors v and w such that 
v = A(Bw). 

(a) How many components must w have? 
(b) Show thats = q. 


(c) How many components must v have? 


Problem 11.27: We define the product of an r x s matrix A and ans xX € matrix B to be the matrix AB such that 
(AB)v = A(Bv) for every vector v in c dimensions. 
_ (a) What are the dimensions of the matrix that equals AB? 

(b) We have seen that if M and N are 3 x 3 matrices, then the entry in row i and column j of the product MN 
is the dot product of row i of M and column j of N. Can we use the same process to find the product AB? 


Problem 11.28: Compute the following products: 
-3 §8\. 
-1 0 
=} Ẹ 


| Problem 11.24; In this problem, we explore matrix addition. 


| (a) We can define the sum of two vectors only if the two vectors have the same number of components. 
| Show that Av + Bv is defined if and only if A and B have the same dimensions. 


(b) Let A and B ber x c matrices. We’d like to define the matrix sum A + B such that (A + B)v = Av + Bv 
for all v. Show that such a matrix A + B exists and explain how to compute it. 
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Solution for Problem 11.24: 


(a) Suppose that v has c components. In order to be able to define the product Av, the matrix A must have c 
columns. Similarly, B must have c columns. 


Next, let A have r rows, so the product Av is a vector with r components. In order to be able to perform 
the vector addition Av + Bv, the vectors Av and Bv must have the same number of components. Therefore, 
Bv must have r components, which means that B must have r rows. Therefore, B must be an r x c matrix, 
just like A. 


(b) Since we have 


v1 al 
My) 42 M3 “° Ak by bu bi ++: bie 
U2 U2 
fy) p27 ë A23 e Ax ba bn b +- bx a 
Av + Bv = : ; ; , 3 U3 ay i , 3 
An An Ay *** Ar by ba ba =- be 
Oc Oc 
A1101 + A1202 + 41303 + +++ + A1cUe byv + by202 + 51303 +--+ + bicte 


47101 + 427202 + A23V3 + +++ + AXU bav aF ba2v2 + bn303 ap eed op boeV¢e 
F 


Ar1V1 + AyQU2 + A303 + +++ + ArcUe bavi aP b,2V2 + b303 qp eee ap brcUe 


(a11 + by1)v1 + (an2 + by2)02 + (a13 + b13)03 + +++ + (age + Dic)Ue 
(421 + b21)v1 + (a22 + b22)v2 + (a23 + b73)03 + +++ + (Are + bzc)Ue 


(anı + by yv ar (42 + by2)v2 ar (4:3 + b,3)03 uC Ge (Arc ar brc)Ue 


al 

4ay+by aotby a+b > act Di a 

azn +b, az +b az +b +++ Ane + bx n 
= . . f 

an +ba an + by ar +b are + bre > 

c 

we can define the matrix sum A + B as 
ay tbu an +b a3+bi3 + act bie 
az + b2 A22 + bon 93 +23 +++ ax + bx 
A+B= i : 

an +tba a@atbe a3t+b3 +: Amt Dre 


Important: | Two matrices can be added if and only if they have the same dimensions. Each 
Y entry in the sum is the sum of the corresponding entries in the two matrices. 


So, addition works in the “obvious” way. How about multiplication of a matrix by a scalar? 


Problem 11.25: We define the matrix equal to kB such that (kB)v = k(Bv) for all v. Show that each entry in kB 


is k times the corresponding entry in B. 
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Solution for Problem 11.25: We have 


[ bu biz bis = bie A 101 + 4202 + 61303 +-+- + Dicd¢ 
(kB)v = k(Bv) =k : : a ee . =k : 
\ by b,2 b,3 Aye Dre by vı + b,2V2 SP b,303 gr ooo ap OTUR 
c 


kby104 oP kby202 + kb1303 qp eo oop kbicve 


kbavi + kbyv2 + kb,3v3 +--+ + kb,cve 


kb kbi2 kbi + kbi i 
Abn kb, kb, ia kb yc a 
kb kb kb +++ kbi 
S| E 
kb; kb,2 kb,3 i kbye 


So, each entry in kB is k times the corresponding entry in B. 


re A ER R e NA 


Important: We multiply a matrix by a scalar by multiplying each entry in the matrix by the 


Vv scalar. 


Oo 


So, addition of two r x c matrices works in the expected way, as does multiplication of any matrix by a scalar. 
In the following series of problems, we explore general matrix multiplication. 
Problem 11.26: Let A be an r x s matrix and B be a q x c matrix. Suppose there are vectors v and w such that 
v = A(Bw). 
(a) How many components must w have? 


(b) Show thats = q. 


(c) How many components must v have? 


Solution for Problem 11.26: 


(a) The product Bw is defined if and only if the number of columns in B equals the number of components in 
w, SO w must have c components. 


(b) Because B is a q X c matrix, the product Bw must be a vector in q dimensions. The product A(Bw) is defined 
if and only if the number of columns in A matches the number of components in the vector Bw. Therefore, 
we must haves = q. 


(c) As explained in part (b), the vector Bw is a vector ins dimensions. Since A is an r xs matrix, it maps vectors 
in s dimensions to vectors in r dimensions. Therefore, the product A(Bw) is a vector with r components. 


As an example, suppose A is a 4 x 2 matrix and B is a 2 x 3 matrix. If vectors v and w satisfy v= A(Bw), then 
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11.4. MORE THAN JUST 2 x 2 AND 3 x3 


v has 4 components and w has 3 components, and the equation v = A(Bw) looks like: 


V1 Aji M2 

wı 
v2 \ _ | an an ts by2 a 
U3 A31 A32 ba bu bz its 
U4 44, A42 


Problem 11.27: We define the product of an r x s matrix A and an s x c matrix B to be the matrix AB such that 
(AB)v = A(Bv) for every vector v in c dimensions. 


(a) What are the dimensions of the matrix that equals AB? 


(b) We have seen that if M and N are 3 x3 matrices, then the entry in row i and column j of the product MN 


is the dot product of row i of M and column j of N. Can we use the same process to find the product 
AB? 


Solution for Problem 11.27: 


(a) Let T = AB. The product Tv is defined only if the number of columns in T equals the number of components 
of v. Therefore, T has c columns. 


The number of components in the product Tv is the number of rows in T. Since Tv = A(Bv), the number 
of rows in T equals the number of components in A(Bv). The matrix A maps vectors in s dimensions to 
vectors in r dimensions, so the result of A(Bv) is a vector in r dimensions. Therefore, T has r rows, which 
means T is an r x c matrix. 


į 


Important: The product AB is defined if and only if the number of columns in A equals the 
W number of rows in B. If A is an r x s matrix and B is an s x c matrix, then the 
| product AB is an r X c matrix. 


(b) We discover how to multiply two matrices (when such a product is defined) in the same way we discovered 
the process for multiplying two 3 x 3 matrices. Suppose A is a r X s matrix and B is a s x c matrix and that 
(AB)v = A(Bv) for all vectors v in c dimensions. Let the rows of A be aj, a2, . . ., a, and the columns of B be 
bı, b2, ..., be. Then, we have 


Vi buvi SF by2v2 SF bi3v3 qeeeage bicUe 
| | | U2 bav ate bauz + bo303 +*+ baeV¢ 
(AB)v = A(Bv) =A bı bo © be ; =A 
[ | | ; : 
Uc bti + byv + D303 +--+ + DycVe 


= A(vıbı + v2b2 +--+- + ube) 
= v Abı + v2Ab2 +--+ v-Ab, 


V1 
| | | U2 
=| Ab, Ab, - Ab, ; 
| | | 
Uc 
| | | 
| | | 
Therefore, we have 
| | | 
AB= | Ab, Ab: Ab, 


413 


CHAPTER 11. VECTORS AND MATRICES IN THREE DIMENSIONS, PART 1 


Since for each i, we have 


ay ai +b; 

a2 a2 - b; 
Ab; = $ b; = , 

ar a; k b; 


we have 


Problem 11.28: Compute the following products: 
l 


= el 


(a) & z 4 ae 7 


= 8 


Solution for Problem 11.28: 


(a) The product of a 2 x 3 matrix and a 3 x 3 matrix is a 2 x 3 matrix: 


4 -3 8 
5 -1 Q 
3 (529) 
_ ( (5)(4) + (-1)) + (0)(-2)  X-3) + (-1)(-1) + (0)(-2)  (5)X(8) + (—1)(0) + (0) (8) 


5 (Sa + (8)(5) + (-2)X(-2) (=3)(-3) + (8X(-1) + (-2)(-2) 3,8) + BO) CND) 


k 6 =14 a 
~\32 5 -40 


(b) The product of a 4 x 2 matrix and a 2 x 3 matrix is a 4 x 3 matrix: 
amS (4)(6) + OGD DECIR E DES) 


ale E =2 a U E O T E 
Cece, 7 4 (0)(6) + (8)(-1)  (0X(-2) + (8)(-7)_———(O)(-3) + (8)(4) 
=i 1 CDO Nel) CTED DEAT EEE 

16 -64 20 

| 2 ge 

7 =e =e ee 

Eo =i 


Oo 


We say that a matrix is square if it has the same number of rows and columns. We can define the determinant 
of a square matrix with more than 3 rows by using expansion by minors in much the same manner as we did for 
3 x 3 matrix. This is what it looks like for a 4 x 4 matrix: 


a b è & di 


a2 bo C2 2 be C2 dz Az C2 dz a2 bz dz A2 bz C2 
3 by as a = A1 b; C3 d3 — bı a3 C3 ds + Cy |a3 bz d3 -dı a3 bs Eal. 

by ca da A4 Ca d4 Ai ba d; aa a 
da ba Ca d4 
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11.5. SUMMARY 


(Note: the “Rule of Sarrus” gimmick for remembering the formula for the determinant of a 3 x 3 matrix does not 
work for finding the determinant of larger matrices.) 


If you enjoy computer programming, try writing a program for finding the determinant of a 4 x 4 matrix. As 
a bigger challenge, try writing a program for finding the determinant of any n x n matrix. 


I __Exercises Ba 


3 4 
11.4.1 Find the product | -1 5 e S alk 
Gay 7 Slee 


11.4.2 Find the product (1) (-1 5 -3). 


11.4.3 Show that if v and w are vectors in n dimensions, then v : w = w- v. 


11.4.4 Show that for any scalar a and any vectors u, v, and w in n dimensions, then we have the following: 


(a) v- (aw) = (av)-w=av-w 


(b) u-(v+w)=u-vt+u-w 


11.4.5 Suppose A is a 2 x 3 matrix. Is there a matrix I such that IA = A for all A? If so, what is it? 


11.5 Summary 


a 

We denote a vector in three dimensions with an ordered triple | b |, which can also be written as ai + bj +ck, where 
c 

i, j, and k are unit vectors from the origin in the direction of the positive x-axis, y-axis, and z-axis, respectively. 

Addition, scalar multiplication, and norm are defined as follows: 


a x atx a ka x 
ely eo ey |, kibjļ=įkb}, y || = V2 + y? + 22, 
c Z c+zZ c ke z 


and the dot product is defined as 
at Wy 
Up J}: | Wo | =U1+ Wy +02: W2 + U3-W3. 
v3 w3 


The dot product has the following properties: 


e u-v=v-u 
e u- (av) =a(u-v) 

e u- (v+w)=u: v+u:w 
e v- v= |v? 


e v- w = |lv/|||wl|cos 0, where 9 is the angle between v and w 


E 
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A 3 x3 matrix represents a linear function that maps vectors in 3 dimensions to vectors in 3 dimensions. This 
mapping is illustrated as a product of the matrix and a vector according to the definition 


M1 A12 443 x 411X + A124 + 413Z 
az, A2 A23 y | = | AX + Ary + 423Z 
431 432 33 Z 431X + 4324 + 433Z 


For any matrix A, any scalar c, and any vectors v and w, we have A(cv) = c(Av) and A(v + w) = Av + Aw. 


We add two matrices by adding the corresponding components in the matrices, and define the product of two 
matrices as follows: 


a4, a2 a3 by, Dyn bis mibu + a42bo1 + ai3b3) anbi + aiban + a13b32 anubis + a12b23 + 413633 
Ax, A2 A23 ba bu bz | = | anbu +anbz + absz aznb1ı2 + azb2z2 + a23b32 az2b13 + a22b23 + 423b33 
431 432 433 b31 b32 b33 31011 + a32b21 + a33b31 431b12 + A32b22 + a33b32 a31b13 + a32b23 + a33b33 


Matrix multiplication is not commutative, but it is associative and distributive. 


Important: The 3 x3 matrix I such that AI =IA =A for any 3 x 3 matrix A is 


1 0 0 
ea i Q 
0 0 1 


We call this the identity matrix. 


Definition: We define the determinant of a 3 x 3 matrix as follows: 


C2 ADE 


a 
-b 2 


Ae 
A3 C3 a3 


= a,b2c3 + byc203 + c142b3 ni a1c2b3 = b4a2C3 = c1b283. 


We can also denote the determinant of a matrix A as det(A). 


We say a matrix is singular if its determinant is 0, and nonsingular if its determinant is nonzero. If det(A) # 0, 
then for any vector c, there is unique vector v such that Av = c. 


The determinant has the following properties: 


e If we produce matrix B by multiplying a row or a column of A by some constant k, then det(B) = k det(A). 
e For any 3 x 3 matrix A and any constant k, we have det(kA) = k’ det(A). 
e For any 3X3 matrix A, swapping the rows or columns of A produces a matrix with determinant (—1) det(A). 


e If one row of a 3 x 3 matrix is a constant times another row, or one column of the matrix is a constant times 
another column, then the determinant of the matrix is 0. 


e For any matrix M, the determinant of a matrix formed when we add a constant times any row of M to 
another row of M equals det(M). Similarly, the determinant of a matrix formed when we'add a constant 
times any column of M to another column of M equals det(M). 
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REVIEW PROBLEMS 


A minor of a matrix A, denoted as Ajj, is the determinant of the matrix that remains when row i and column j 
are omitted from matrix A. We can compute the determinant of A as 


det(A) = (-1) "an Aa + (1) apAin + (-1)'PagAa 
fori = 1,2, or 3, and as 

det(A) = (-1)!*/a;A1; + (-1)?agjA2j + (-1)?/a3jA3j 
for j = 1, 2, or 3. This technique is called expansion by minors. 


An matrix with r rows and c columns (an r X c matrix) represents a linear function that maps vectors with 
c components to vectors with r components. The product of an r x c matrix and a vector with c components is 
defined as follows: 


m m m m v1 M1101 + M1202 + M1303 + +++ + MicUe 
1 3 coe 
m ! a on ie U2 M2101 + M2202 + 112303 +++ + MV¢ 
21 22 ply OR 
: = 03 = M3101 + 113202 + 113303 + ++ + M3cVe 
My, Mnp m ooo m ; 
r r r3 rc Uc M101 + Mr + My,303 + +++ + MycVe 


Important: The product of an r x c matrix and a vector in n dimensions is defined if and. 


Y only if c = n. 


We can define the matrix sum A + B if and only if A and B have the same dimensions, in which case each entry 


in the sum A + B is the sum of the corresponding entries in the two matrices. 


Important: The product AB is defined if and only if the number of columns in A equals the 
Vv number of rows in B. If A is an r x s matrix and B is an s X c matrix, then the 


product AB is an r x c matrix. | 


Let the rows of A be aj, a2,...,a, and the columns of B be bı, b2,..., be. If the matrix product AB is defined, 
then the product is given by 


aı b; a1: bo a, -b3 ies ai’ be 

az: bı a b> a: b3 cca a: be 
AB= : ‘ : 

ay: bi acb a,--b3 © ap be 


REVIEW PROBLEMS — 


Unless stated otherwise, all vectors in the Review Problems and Challenge Problems are in 3 dimensions, and 
all matrices are 3 X 3 matrices. 


2 4 
11.29 Letu = 3 | andv= 5 
—5 —1 


(a) Find juli and |v]. 
(b) Find (2u + 3v) - (u - 3v). 
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4 -3 —4 -2 
11.30 Which two of the following four vectors are orthogonal: ( 5 ) ( 4 ) ( 2 ) ($) R 
-1 2 3 3 


11.31 Show that (x11 + x2y2 + x3y3) < (x2 + x3 + XZY? + y3 + y3). 


11.32 


4 5 -2 -1 
(a) Find the product | 1 -3 4 Ia 
0 1 -2 4 
-1 


(b) Is the matrix in part (a) the only matrix that can be multiplied by ( 1 ) to give your answer to part (a)? If 
4 


not, then find another. 
=| 
(c) Isthe vector | 1 | from part (a) the only vector that can be multiplied by the matrix given in part (a) to 
4 
get your answer to part (a)? 


11.33 


(a) If Av equals the vector 0, then must either A be 0 or v be 0? 
(b) If AB equals the matrix 0, then must either A or B be 0? 


4 -2 -1 = 2 © 
11.34 Find the product | 0 5 -1 =1 ð hk 
= O si ET) 


11.35 


(a) Is there a matrix P such that PA is the matrix that results when we swap the first two columns of A and 
leave the third column unchanged? 


(b) Is there a matrix P such that AP is the matrix that results when we swap the first two columns of A and 
leave the third column unchanged? 


11.36 If Av + Bv = 0 for all vectors v, then must A = —B? 


11.37 Compute the following two determinants: 


=| 1/3 1/5 6 
(Cae |= 3 2 (b) 4 -3 5 
-3 1 8 A e 0 


11.38 Use Cramer’s rule to solve the following system of equations: 


BG OA e DZ E =o; 
2x — Sy + 3z = 4, 
ER a Ly oa 


x-4 x-3 x-2 
11.39 Evaluate |x — 1 x x+1). 
x+2 x+3 x+4 


CHALLENGE PROBLEMS 


11.40 


(a) As described on page 399, the matrix A! is the transpose of A. Show that if A and B are 3 x3 matrices, then 
(AB) = B'AT. 


(b) We can extend the concept of a transpose matrix to r X c matrices as follows. Let A be an r x c matrix with 
row vectors, in order, of a), a2,...,a,. The transpose of A, denoted A7, is the c x r matrix whose columns, in 
order, are a1, a2,...,a,. Show that if AB is defined, then (AB)! = BTAT. 


4 0 2 

5 =l =í a 
11.41 Find the product QO 2}. 

3 -2 7 a 

-1 5 2 


11.42 IfA is 3 x 4 matrix, C is 3 x 6 matrix, and AB = C, then what are the dimensions of B? 


11.43 


(a) Show thatif the product of two square matrices is defined, then the two matrices have the same dimensions, 
and the product has the same dimensions as the original matrices. 


(b) Ifthe product AB equals a square matrix, then must A and B be square matrices? 


Challenge Problems 


11.44 Letu, v, and w be vectors such that each vector has magnitude 1 and each vector is orthogonal to the other 
two. Show that if r = au + bv + cw, then we must havea =r-u,b=r-v,andc=r-w. 


11.45 Letu = i+ j+ zk and v =2i-j + 3k. 


(a) Find all z such that the angle between u and v is 7/2. 

(b) Find all z such that the angle between u and v is 77/3. 

(c)* Find the maximum possible value of cos? @ if it is possible to find a value of z such that the angle between 
u and vis @. Hints: 34, 33 

a 

b2 


1 a 
11.46 Express|1 b as a product of three polynomials in a, b, and c with nonzero degree. Hints: 71 
1 c 


11.47 We say that a matrix M is skew-symmetric if it satisfies M” = -M. Show that if a 3 x 3 matrix M is 
skew-symmetric, then det(M) = 0. 


11.48 
(a) Show thatif A is a constant such that there exists a nonzero vector v for which Av = Av, then det(A— AI) = 0. 
Hints: 51 
-8 1 3 
(b) Let A= | -4 1 7 J. Find three values of A such that det(A — AI) = 0. These values are called the 
@ il =i 


eigenvalues of A. 


(c) For each value of A you found in part (b), find a nonzero vector v such that Av = Av. These vectors are 
called eigenvectors of A. 


r r = EE 


CHAPTER 11. VECTORS AND MATRICES IN THREE DIMENSIONS, PART 1 
11 0 

11.49 LetA=[{0O 1 1). FindA”. 
0 0 1 


11.50 Suppose A = (2, : 7) and B = z Py Is there a matrix M such that AM = B? If so, is it 


unique? 
11.51 Suppose that A and B are matrices such that AB = —BA. 


(a) If Aand B are 2 x 2 matrices, must at least one of A and B be singular? 


(b) If A and B are 3 x3 matrices, must at least one of A and B be singular? 


-2 4 -4 
11.52x LetA= ( 0 2 | . Prove that det(I — A2°*) is not equal to 0. Hints: 245, 123 
-8 -2 4 
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If there's no struggle, there's no progress. — Frederick Douglass 


2 
lll 


Vectors and Matrices in Three Dimensions, Part 2 


12.1 Lines and Planes in Three Dimensions 


As in two dimensions, a linear combination of a set of vectors v1, V2,...,Vn is a sum 
AIVI + AQV2 + +++ + AnVn 


of products of scalars and the vectors. In this section, we'll investigate the geometry of linear combinations of 
vectors in three dimensions. 


|| Problems ig 


Problem 12.1: In this problem, we explore using equations to describe lines in three dimensions. Let A be 
(3,4, —2) and B be (-1,5,0), and let £ be the line through A and B. 


(a) Suppose that C is a point on f besides A and B. How must AC and AB be related? 
(b) Show that if (x, y, z) is on £, then we must have 


ONORE 


for some value of t. Does substituting any real number t into this equation produce a point (x, y, z) that 
is on £? 


Problem 12.2: Let A be (-1,3, 2), B be (—3,5, -1), C be (0, —2, 8), and D be (4,1,7). Do AB and EB intersect? 
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Problem 12.3: In this problem, we investigate linear combinations of pairs of vectors. For each of the values of 
x 


v and w given below, describe the graph of all points (x, y, z) that satisfy (3) = sv + tw for some values of s 
Z 


and ż. 
(a) v=j,w=3j 
b) v=iw=j 
(c) v=i+k,w=k 


(d) Your answer to part (a) should be a different type of geometric object than your answers to parts (b) and 
(c). Why does this occur? 


Problem 12.4: 


xX =l 
(a) Describe the graph of the parametric equation () = ( 3 ) + si + tj, where s and t are parameters. 
Z 4 


x 
(b) How isthe graph in part (a) related to the graph of (3) = si + tj? 
z 


Problem 12.5: Suppose neither v nor w is a multiple of the other. Show that the graphs of the parametric 


equations 
ay 
= |b, | +siv+thw, 
Cy 


a2 
ba | +s2v + bhw, 
C2 


pe 
ee 
i 


Yee oN 
<a 
H 


are either the same or do not intersect. 


. (a) Suppose that C is a point on ¢ besides A and B. How must AC and AB be related? 
(b) Show that if (x, y,z) is on £, then we must have 


for some value of t. Does substituting any real number t into this equation produce a point (x, y, z) that 
is on €? 


Solution for Problem 12.1: 


(a) IfA, B, and C are collinear, then the angle between AC and AB must be either 0 or x, so AC must be a scalar 
multiple of AB. 


EE a TT 


12.1. LINES AND PLANES IN THREE DIMENSIONS 


= 


(b) LetCbe (x, y, 2). Since C is on €, we must have AC = tAB for some scalar t. Since AC = C—A and AB = B -—A, 
we have C = A + (B — A), or 


“= DEORE ORG ae 


Any point (x,y,z) on € must satisfy this equation for some value of t. Conversely, every value of t gives 


a point (x,y,z) on the line, because C is on £ whenever AC = tAB. Therefore, Equation (12.1) gives us 
parametric equations whose graph is a line in space: x = 3 — 4t, y = 4 + t, z = —2 + 2ł. 


0 


Our solution to Problem 12.1 should be familiar, since it’s the same process we used to describe a line in two 
dimensions in Section 9.3. We can use this same procedure to express any line in space with a parametric equation 


of the form 
x a 
ysl td 
Z c 


The point (a,b,c) is on the line and d is a direction vector of the line. Any vector between two distinct points on 
the line must be a scalar multiple of d. We also sometimes say that this direction vector is “parallel” to the line. 
We can think of the parametric equation as telling us, “To get to any point (x, y,z) on the graph, start at (a, b,c) 
and then travel in the direction of d (or the opposite direction).” Since every point in the graph besides (a,b,c) 
is the same (or opposite) direction from (a, b, c), the graph of this equation produces the line through (a,b,c) in 
direction d. 


Just as in two dimensions, a given line does not have a unique direction vector, but if dı and dz are both 
possible direction vectors for the same line, then dı = kdz for some nonzero scalar k. 


| Problem 12.2: Let A be (—1,3,2), B be (-3,5, —1), C be (0, -2, 8), and D be (4, 1,7). Do AB and ED intersect? 


Solution for Problem 12.2: We start by finding parametric forms for both lines. Proceeding as in the previous 
problem, for any point (x, y, z) on AB, we must have 


eet OROROIe Se 


So, AB is the graph of the parametric equations x = —1 — 2t, y = 3 + 2f,z = 2 — 3t. 


Turning to CO the point (x, y, z) is on ED if 


e A “nee 


Therefore, ED is the graph of the parametric equations x = 4u, y = -2 + 3u, z = 8 - u. 


Since all points on AB are of the form (-1 - 2t,3 + 2t,2 — 3t) for some £, and all points on ED are of the form 
(4u,2 + 3u,8 — u) for some u, lines 4B and CD intersect if and only if there are values of f and u for which these 
two points are the same. This gives us the system 


—1 -—2t = 4u, 
3+2łt=2 + 3u, 
2-3t=8-u. 
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Adding the first equation to the second gives 2 = 2 + 7u, from which we have u = 0. Substituting this into the first 
equation gives t = —1/2. However, (t, u) = (—1/2, 0) does not satisfy the third equation, so there is no solution to 


this system of equations. Therefore, AB and CD do not intersect. 0 


Now that we have a sense of the geometry that results from considering all multiples of a single vector in three 
dimensions, let’s take a look at linear combinations of pairs of vectors in three dimensions. 


Problem 12.3: For each of the values of v and w given below, describe the graph of all points (x, y, z) that 


x 
satisfy (3) = sv + tw for some values of s and t. 
z 


| (a) v=j,w=3j 
(b) v=i,w=j 
(c) v=i+tkw=k 


(d) Your answer to part (a) should be a different type of geometric object than your answers to parts (b) and 
(c). Why does this occur? 


Solution for Problem 12.3: 
(a) We have 


xX 0 
(3) = sv + tw = sj + 3tj = (s + 3t)j = (+) 
z 0 


Since s and t can take on any value, the expression s + 3t can take on any value. Therefore, every point on 
the y-axis is in the graph, and no other points are. So, the graph is the y-axis, which is a line. 


(b) We have 


xX s 
y | =sv+tw=si+tj=|t]. 
Z 0 


Since s and ¢ can take on any value, every point in the xy-plane appears in the graph. There are no points 
in the graph with nonzero z-coordinate, so the graph is the xy-plane. 


(c) Wehave 
X s 
y | =sv +tw =si+sk + tk = si + (s + t)k = OPE 
Z s+t 


To see that any point (a,0,b) in the xz-plane is in the graph, note that letting s = a and t = b — a in the 
equation above gives (x, y, z) = (a, 0, b). Therefore, the graph is the xz-plane. 


(d) Inthe first part, w is a multiple of v, and the resulting graph is a line. To check if the graph is always a line 
when w is a multiple of v, we let w = cv for some constant c. Then, we have 


x 
() = sv + tw = sv + tcv = (s + tc)v. 
Z 


Since we can choose any values of s and t, the expression (s + tc)v can be any multiple of v. Therefore, the 
graph is a line through the origin with direction vector v. 


In parts (b) and (c), w cannot be expressed as a constant times v, and the resulting graph in each case is 
a plane, not a line. O 


O_o a — SCSCSC~C~*~;3C3>ET SSS 


12.1. LINES AND PLANES IN THREE DIMENSIONS 


—— a a A a a I a 


Just as we did in two dimensions, we say that a pair of vectors v and w are linearly dependent if and only 
if there are nonzero scalars a, and az such that a,v + azw = 0. Otherwise, the vectors are linearly independent. 
Since we can write w = cv as cv + (-1)w = 0, our work in part (d) above shows that that the graph of 


x 
y | =sv+tw 
Z 


is a line if v and w are linearly dependent (and not both 0). 


Our work in parts (b) and (c) gives two examples in which v and w are linearly independent and the resulting 
graph fits our geometric concept of a plane. Let’s think a little bit about why the graph of all points corresponding 
to linear combinations of two linearly independent vectors matches our geometric idea of a plane. What follows 
is not intended to be a rigorous proof, but rather to give us some intuition for why our geometric idea of a plane 
fits with the graph of all linear combinations of two linearly independent vectors. We’ll start with the fact that 
any three noncollinear points in space define a plane—the plane that contains the triangle with those three points 
as vertices. 


Suppose we start with the three noncollinear points V, W, and the p 
origin, which we'll call O. We'll let v = Vandw = W, so v and w drawn 
from the origin are two sides of AVOW. Let P be the plane that contains 
this triangle, and let fy and fw be the lines through the origin that contain 
v and w, respectively. Then, lines fy and fw are in P. Since any multiple 
of v lies along fy and any multiple of w lies along £w, the vectors sv and 
tw are also in P. We locate sv +tw by completing the parallelogram with 
one vertex at the origin and sides sv and tw, as shown at right. Since 
the triangle with sides sv and tw is in P, so is this parallelogram, which 
means that sv + tw is in P, too. 


This gives us some idea why sv + tw must be in the same plane as 
v and w, but it doesn’t explain why every point in this plane is in the 


graph of 
x 
y | =sv+tw. (12.2) 
Z 


Suppose that Q is a point in plane P. If Q is on £y or fw, then we clearly 
have Q = sv + tw for some s and t. Otherwise, we draw lines through 
Q parallel to £y and fw, thereby completing a parallelogram OAQB, as 


shown. Since A is on fy and B is on fy, we have A = sv and B = tw for 
some constants s and t. Therefore, we have 


O=A+B=sv+tw, 


which means that Q is in the graph of (12.2). 


The graph of linear combinations of two linearly independent vectors so nicely matches our concept of a plane 
that we use it to define a plane algebraically. 
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Definition: Let v and w be linearly independent. The plane through the origin generated by v and w is the 
graph of the parametric equation 
x 
í s) = 5V + tw, 


Z 
| where s and t are parameters. | 


We will say that a vector v is in a plane ĦP if there is some pair of points in P such that the vector between the 
points is v. So, for example, the vector i + j is “in the xy-plane.” 


Our definition only covers planes through the origin. What about planes that don’t pass through the origin? 


Problem 12.4: 


x =k 
| (a) Describe the graph of the parametric equation (3) = ( 3 ) +si+tj. 


z 4 


a 


aS 
(b) How is the graph in part (a) related to the graph of (3) = sit tj? 
z 


Solution for Problem 12.4: 
(a) We have 


Since s and £ can take on any values, the first two coordinates of (~1 + s,3 + t, 4) can be anything. So, the 
graph consists of all the points in space with z = 4, which means that the graph is a plane parallel to the 
xy-plane and 4 units above the xy-plane. 


Another way to think about this is to start with the plane generated by i and j. Then, we form the graph 


x —1 
y|}=| 3 |+si+tj 
Zz 4 


=l 
by adding í 3 ) to every vector from the origin to a point in a plane generated by i and j. In other words, 
4 


of 


=l 
we translate the plane generated by i and j by the vector ( 3 , 
4 
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Yet another way to think about it is that we form the graph by starting 
from (—1,3,4), and then traveling by the vector si + tj to find (x, y,Z). 
Starting from a point and then considering the result of traveling by 
every possible linear combination of two linearly independent vectors 
is exactly how we (algebraically) defined a plane through the origin. 
The only difference here is that we are starting from (—1,3,4) instead of 
the origin. This observation inspires a definition of a plane defined by 
two linearly independent vectors and a point through which the plane 
passes: 


Definition: Let v and w be linearly independent. The plane through the point (a, b,c) generated by v and w 
is the graph of the parametric equation 


a 
+sv+tw, 


Notice that when we have (a, b,c) = (0,0, 0), this definition matches our earlier definition of a plane through 


where s and t are parameters. 


the origin. 
x 
(b) The graph of | y | = si+ tj is the xy-plane, as described in part (b) of Problem 12.3. This plane is the graph 
z 


of z = 0. The plane in part (a) is the graph of z = 4. We cannot have both z = 0 and z = 4, so the plane in 
this part and the plane in part (a) do not intersect. 


Oo 


We say that two planes are parallel if either they don’t intersect or they are the same plane. The planes 
in Problem 12.4 are generated by the same two vectors, and we found that the resulting planes are parallel. 
Geometrically, this makes sense. The vectors that generate a plane determine the “direction” of the plane, so it’s 
not surprising that the two planes generated by the same vectors are parallel. We can use our algebraic definition 
of a plane to prove that this is the case. 


Problem 12.5: Suppose v and w are linearly independent. Show that the graphs of the parametric equations 


X% a 
y |=| b ]|+sv+tw, (12.3) 
z c 
x a’ T 
=| 0 |+pv+qw, (12.4) f 
G 


Solution for Problem 12.5: The two graphs have a point in common if and only if there are scalars s1, t1, pı, qı such 


that ; 
a a 

a) +sıVv+hw= (2) + piv + qiw. (12.5) 
c E 
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If there are no such scalars, then the graphs do not intersect, and the graphs are parallel. All that’s left to show is 
that if such scalars exist, then the graphs are the same. 


For any point (xo, Yo, Zo) on (12.3), there are scalars sọ and tọ such that 


Xo a 
Yo | = b | +sov + tow. (12.6) 
ZO C 


To see that (xo, Yo, Zo) is on (12.4), we first rearrange (12.5) as 


a’ a 
() = (a) + (sı = pı)v ar (fy = qı)w. (12.7) 
& c 


Now, letting p = pı — $1 + So and q = qı — ti + to in (12.4) gives 


x a’ 
y]= b’ | + (pı — S1 + So)V + (qı —h+ to) Ww 
Z C 
a 
= b | + (s1 - piv + (t - qı)w + (pı — S1 + So)v + (qı — fi + to)w 
c 
a 
=D 1 + Sov + tow, 
Cc 


Xo 
which equals our expression for | yo | from (12.6). Therefore, (xo, yo, Zo) is in both graphs. Similarly, we can show 
Z 


0 
that any point in the graph of (12.4) is in the graph of (12.3), so the graphs of the two equations are the same. 


Since the planes that are the graphs of (12.3) and (12.4) either don’t intersect or are the same, they are parallel. 
Oo 


|| Exercises Ea 


12.1.1 Find vectors v and w such that 


x 
y |=v+wt 
Z 


describes the line through (—4, 5,0) and (—5, —1, 2). Are the vectors you found the only possible values of v and w? 
12.1.2 Find a vector that is parallel to the graph of x = 4- 2t, y =2+t,z=2-3t. 


12.1.3 Let P be the plane through (-2, —1, 4) generated by 2i — j + k and —3i + 2k. Show that (10, —4, 3) is on P, 
but (—1,5, 2) is not. 


12.1.4 Find x if xi + 7j + 10k is in the plane through the points (—1, 2, 4), (3, —1, 2), and (5, 1,6). 


12.2 More Planes in Three Dimensions 


Now that we have an algebraic definition of a plane, we can use algebraic tools on problems involving planes. In 
this section, we’ll explore these tools. We’ll also address the question of what happens when we consider linear 
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combinations of three vectors in three dimensions. 


|) _Problems ii 


Problem 12.6: Let v and w be linearly independent and let a and b be linearly independent. In this problem, 
we show that if v and w are in the plane through the origin generated by a and b, then the graphs of 


x x 
y | =sv+tw and y | =pa+qb 
Z Z 

are the same. 


(a) Show that for every choice of s and t, we can find values of p and q such that both equations produce the 
same point (x, y, z). Hints: 32, 274 


(b) Show that for every choice of p and q, we can find values of s and t such that both equations produce the 
same point (x, y, z). Hints: 5 


Problem 12.7: In this problem, we find an equation whose graph is the plane through the points (0,4, 0), 
(ae) and (2, —2, —4). 
(a) Find a parametric equation whose graph is the plane. 


(b) Show that there is an equation of the form ax + by + cz = d, where a, b, c, and d are constants, whose graph 
contains every point in the plane. 


Problem 12.8: In this problem, we show that the graph of nx + nzy + n3z = d is a plane, where 11, n2, nz, and d 
are constants and at least one of 111, nz, n3 is nonzero. 
(a) Suppose nı # 0. Solve the equation for x. Use the result to express the equation in a form that fits our 
algebraic definition of a plane. 


(b) Complete the proof by addressing the case n, = 0. 


Problem 12.9: Let v and w be linearly independent and let xo, yo, and zp be constants. 
(a) Show that we can always eliminate the parameters from the parametric equation 


x Xo 
y | =| Yo | +sv+tw 
Z Zo 


to produce an equation of the form mx + ngy + n3z = d, where nı, n2, ng, and d are constants. 


(b) Show that every point in the graph of the equation of the form nıx +n2y+n3z = d produced by eliminating 
the parameters is in the graph of the original parametric equation. 


Problem 12.10: In this problem, we find a vector that is orthogonal to every vector in the plane through the 
origin generated by v = 3i + 4j — k and w = 4i - 2j + 3k. 
(a) Find an equation of the form mx + nzy + n3z = 0 whose graph is the plane through the origin generated 
by v and w. 


(b) Letn = mi + mj + 13k. Show that n is orthogonal to both v and w. 


(c) Isn orthogonal to every vector in the plane? 
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Problem 12.11: In Problem 12.7, you found the equation of a plane through the points (0,4, 0), (3,-1,-3), and 
(2,-2,—4). This answer should be of the form nx + nzy + n3z = d, where nı, nz, ng, and d are constants. Is 
n = mi + mj + nak orthogonal to the vector between any two points in the plane? 


Problem 12.12: Show that the vector 
n= n;i + Nj ap n3k 


is orthogonal to the vector between any two points in the graph of 
NX + my +N3z =d, 


where nı, nz, na, and d are constants. 


Problem 12.13: 
(a) Find an expression for proj,,v in terms of v - w, ||w||, and w. 


(b) The distance between a point and a plane is the length of the shortest segment between the point and 
a point in the plane. Find an expression for the distance between the point (xo, yo, zo) and the plane 
ax + by + cz +d = 0, where a, b, c, and d are constants. 


Problem 12.14: Let u, v, and w be vectors in three dimensions such that v and w are linearly independent. We 
will explore the graph of the parametric equation 


K 
C) =ru + sv + tw. (12.8) 
z 


(a) Show that if u is a linear combination of v and w, then the graph of Equation (12.8) is a plane. 


(b) Suppose that u is not a linear combination of v and w. Then, what is the graph of 


x 
(3) =u +sv +tw? 
z 


(Notice that the coefficient of u here is 1, not r.) 


(c) Suppose that u is not a linear combination of v and w. Then, what is the graph of 


x 
(3) = 2u + sv + tw? 
Z 


How is your answer related to the graph in part (b)? 


Suppose that u is not a linear combination of v and w. What is the graph of Equation (12.8)? 


Problem 12.15: We say that vı, v2, and v3 are linearly dependent if there exist constants 41, a2, and a3, not all 0, 
such that avı + a2v2 + 43v3 = 0. Otherwise, we say the vectors are linearly independent. Show that if u, v, and 
w are linearly independent, then for any (x, y, z), then there is exactly one ordered triple (r,s, t) such that 


x 
() =rut+sv+tw. 
Z 
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Problem 12.6: Let v and w be linearly independent and let a and b be linearly independent. Show that if v 
and w are in the plane through the origin generated by a and b, then the graphs of 


x 
) ssn and OEST 
Z 


Solution for Problem 12.6: We must show that every point in the graph of 


ne 
(3) = sv + tw, (12.9) 

Z 

x 

y | =pa+gpb, (12.10) 

Z 


is also in the graph of 


and vice versa. 


We'll first show that every point in the graph of (12.9) is on the graph of (12.10). We do so by showing that for 
any choice of s and t in (12.9), we can find values of p and q in (12.10) that produce the same (x, y, 2). 


We are given that v and w are in the graph of (12.10), so there are scalars p1, q1, p2, q2 such that 


v=pa+ qıb, 
w = pa + gob. 


Substituting these into (12.9) gives 


X 
(3) = s(pja + qib) + t(p2a + q2b) = (spi + tp2)a + (Sq + tq2)b. (12.11) 
z 


So, for any choice of s and t in (12.9), letting p = sp; + tp2 and q = sq; + tq2 in (12.10) produces the same (x, y, 2). 
Therefore, every point in the graph of (12.9) is also in the graph of (12.10). 


We must now show that we can go the other way. Specifically, we must show that for any choice of p and q 
in (12.10), we can find s and t in (12.9) that produces the same (x, y,z). Equation (12.11) gives us a good starting 
point. The graph of Equation (12.11) is the same as the graph of Equation (12.9). So, if for any p and q, we can find 
s and t such that 


p = spi + tpz, 
q = sqı + t2, 
then the graphs of (12.9) and (12.10) are the same. This system has a solution (s, t) for any (p,q) if and only if 


pı P2 # 0. To show that this determinant is nonzero, we go back to the definitions of p1, q1, P2, q2. These are the 


1 
scalars such that 


v=pa+qb, 
w = pa + gob. 
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pı p2 
qı 2 


v and w is a multiple of the other, which we can’t have because v and w are linearly independent. Therefore, 


If 


= 0, then one column of i a : 3) must be a multiple of the other. But this would mean that one of 


we conclude that p : j ] + 0, which means that for every pair (p,q), we can find constants s and t such that 
1 42 


spi + tp2 = p and sq; + tqz = q. So, for every choice of p and q in (12.10), we can find values of s and t in (12.9) that 
produce the same (x, y, Z). 


We have therefore shown that every point in the graph of (12.9) is in the graph of (12.10), and vice versa. We 
conclude that the graphs of these equations are the same. In other words, the plane generated by v and w is the 
same as the plane generated by a and b. O 


As a key step in our solution to Problem 12.6, we proved that if linearly independent vectors v and w are both 
in the plane generated by linearly independent vectors a and b, then a and b are in the plane generated by v and 
w. This statement is equivalent to the following: 


Important: If linearly independent vectors v and w are both linear combinations of linearly 


independent vectors a and b, then a and b are both linear combinations of v 
and w. 


Since we can choose v and w to be any two linearly independent vectors in the plane generated by a and b, 
our work in Problem 12.6 also tells us that: 


Important: 


Any two linearly independent vectors in a plane can be used to generate the 
plane. 


Also, there’s nothing special about planes through the origin. We could have used the same steps we used in 
Problem 12.6 to show that any two linearly independent vectors in a plane P through a point (a,b,c) generate P. 


Problem 12.7: In this problem, we find an equation whose graph is the plane through the points (0,4, 0), 
(3, —1, —3), and (2, -2, —4). 


(a) Find a parametric equation whose graph is the plane. 


(b) Show that there is an equation of the form ax + by + cz = d, where a, b, c, and d are constants, whose 
graph contains every point in the plane. 


Solution for Problem 12.7: 


(a) We know how to find equations for a plane given a point in the plane and two vectors contained by the 
plane. We can use the three given points to produce the two needed vectors. Let (0,4,0) be A, let point 
(3, -1, -3) be B, and let (2, -2, -4) be C. Then, the plane is the graph of 


x >.<: n 0 a=) 2-0 0 3 2 
y | =SA+sAB+tAC=]| 4] 4+5( -1-4 | +8) 2-4) = | 4) Pe —5]+t| -6 ). 
z 0 ==) == 0 =8 —4 


0 3 2 

We could have replaced £) with (3) ; (3) , or any other vector corresponding to a point in the 
0 =2 —4 

plane. i 
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(b) We can write our parametric equation as a system of equations: 


x= 3542, 
y = 4-—5s — 6, 
z = —3s — 4t. 


We produce an equation involving only x, y, and z by eliminating the parameters. Adding the first and 
third equations gives x +z = —2¢. Subtracting 5 times the third equation from 3 times the second gives 
3y —5z = 12 + 2t. Adding x + z = —2t to 3y — 5z = 12 + 2t gives 


x + 3y — 4z = 12. 


Our solution to Problem 12.7 doesn’t show that the graph of the parametric equation is the same as the graph 
of x + 3y — 4z = 12. We only showed that every point in the graph of the parametric equation is in the graph of 
x + 3y — 4z = 12. We didn’t show that every point in the graph of x + 3y — 4z = 12 is in the graph of the parametric 
equations. We’ll defer that proof to Problem 12.9, where we tackle the general case. 


Back in Chapter 9, we showed that the graph in two dimensions of any equation of the form ax + by = c is a 
line, and that every line in two dimensions is the graph of an equation of this form. Our work in Problem 12.7 
suggests we might be able to prove a similar statement about planes in three dimensions. 


In the next two problems, we show that every plane in three dimensions is the graph of an equation of the 
form nix +nzy +nsz = d. We do so by first showing that the graph of every equation of the form mx + my +n3z = d 
is the same as the graph of some equation of the form 


x Xo 
y}= | yo | +svt+tw, (12.12) 
Z Zo 


where v and w are linearly independent. Then, we use this fact to show that the graph of any equation of the form 
(12.12) is the same as the graph of some equation of the form mx + nzy + n3z = d. Make sure you see why both of 
these steps are necessary—the first tells us that the graph of mx + nay + n3z = d is always a plane, and the second 
tells us that every plane is the graph of some equation of this form. 


We go through this trouble for several reasons. Sometimes equations of the form mx + nz2y + n3z = d are much 
more convenient to work with than equations of the form (12.12). More importantly, we will use these forms 
together to explore graphs of linear combinations of three linearly independent vectors. Finally, in Section 12.3, 
we'll use them to discover a very useful operation for vectors in three dimensions. 


(<= =n een 
‘Problem 12.8: Show that the graph of mx + nzy + naz = disa plane, where nı, n2, n3, and d are constants and 
lat least one of nı, 12, n3 is nonzero. 


Solution for Problem 12.8: To show that the graph must be a plane, we must show that the graph fits our algebraic 
definition of a plane. This means we have to show that it is the graph of a parametric equation of the form 


x a 
y |=| b |+sv+tw 
z c 


for some linearly independent vectors v and w and some constants a, b, and c. To see how, let’s focus on x in the 
parametric form. We wish to have x = a + sv; + tw, where v; and w are the first components of v and w. On the 
right side of the parametric equation, only s and f can vary—4, vı, and w; are constants. So, we seek an equation 
of the form x = a + sv; + fw, in which x is expressed in terms of two variables. 
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If x + 0, we can solve the given equation nx + nzy + nsz = d for x in terms of the variables y and z, which gives 
us 


So, we can simply let y = s and z = t be the parameters. Therefore, the graph of nx + nay + nsz = d is the graph of 


X d/n =n/n =n3/nı 
vsi ees 1 +t 0 : (12.13) 
Z 0 0 1 


=n2/n —13/1 
The vectors i and 0 are linearly independent, since the only way to make the last two 
0 1 


—n2/m -n/n 
” components of s 1 +t 0 equal 0 is to let s = t = 0. Therefore, the graph of (12.13) is a plane. 
0 1 


If 1, = 0, then at least one of n and nz is nonzero. We can solve 1\x+n2y+n3z = d for the variable corresponding 
to the nonzero coefficient just as we solved for x above. Then, we can use the other two variables as the parameters, 
just as we used y and z as parameters above. So, in this case, we can still express the equation in a parametric 
form matching our algebraic definition of a plane. O 


Important: Ifa, b, c, and d are constants and at least one of a, b, and c is nonzero, then the 


Vv graph of ax + by + cz = d is a plane. 


Problem 12.9: Let v and w be linearly independent and let xo, yo, and zo be constants. — 
(a) Show that we can always eliminate the parameters from the parametric equation 


x Xo 
y | =| yo | +svt+tw 
Z ZO 


Myx + ny + 13z =d, 


to produce an equation of the form 


where nı, m, 13, and d are constants. 


(b) Show that every point in the graph of the equation of the form 1,x+n2y+n3z = d produced by eliminating 
the parameters is in the graph of the original parametric equation. 


Solution for Problem 12.9: 


(a) We write the parametric equation as the system 


X = Xo +80, + tw, 
Y = Yo + Sv + two, 
Z = Zo + SV3 + tw3. 


We could eliminate the parameters one at a time, but instead, we’ll be a little more clever, and try to 
eliminate them both at once. We think to do this by noticing that the last two terms in each of the equations 
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above are a dot product: 


This is nice because it allows us to deal with both s and t at the same time—if we eliminate ( ) from the 
system, then we have eliminated both parameters. We now wish to show that there are constants 11, n2, and 
nz, not all 0, such that we can multiply the first equation by nı, the second by mz, and the third by nz, add 
the three resulting equations, and thereby eliminate both parameters. After we perform this multiplication 
and addition, we have 


S Vy s V2 S U3 
4X + N2Y + N3Z = N1Xo + N2Yo + N3Zo + nı i . w +n t : w +3 t 2 ws 


= mxo + mayo + naza + (8) - (m (2) +m (2) +m (2Y) 
= 1X0 2Yo 320 i 1 w 2 (w 3 (ws . 


sS\. v U Os var 
) if we can choose nı, 12, n3, not all 0, such that nı ( A +n ( A +3 ( ` is 0. 
t w1 W2 W3 


Because any three vectors in two dimensions are linearly dependent, we are guaranteed to be able to find 


We can eliminate ( 


U U v 7 
scalars 11, nz, n3, not all 0, such that nı ee nz a n3 3) = 0. Lettin nı, nz, nz take on these values 
WwW W w3 & 


eliminates the dot product and leaves us 
MX + N2Yy + NgZ = N1Xo + N2Yo + N3Zo. (12.14) 


Note that the right side of this equation isn’t surprising because the point (xo, yo, Zo) must be on the plane. 


Letting d = 1X9 + N2Yo + 1329 in Equation (12.14) completes our proof that we can always eliminate the 
parameters to produce an equation of the form mx + nzy + n3z = d, where nı, n, ng, and d are constants. 


(b) Our work in part (a) shows that we can eliminate the parameters from a parametric equation of the form 


x XO 
y | =| yo | +sv+tw (12.15) 
Zz Zo 


to produce an equation of the form 11x + ny + n3z = d. This means that every point in the graph of the 
parametric equation is in the graph of nıx + my + 3z = d. But we haven’t shown that every point in the 
graph of x + n2y +nsz = dis in the graph of the parametric equation, so we don’t yet know that the graphs 
are the same. In Problem 12.8, we showed that we can go from an equation of the form mx + nay + naz = d 
back to a parametric equation. Specifically, the graph of nıx + ny + n3z = d is the same as the graph of the 
parametric equation 


x X1 
y |=| yı | +pa+gb (12.16) 
Z Z1 


for some point (x1, y1, Z1) and some linearly independent vectors a and b. Now our problem is to show that 
the graphs of (12.15) and (12.16) are the same. 
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We know that every point in the graph of (12.15) is in the graph of (12.16). Specifically, we know that 
(xo, Yo, Zo) is in the graph of (12.16), so by Problem 12.5, the graph of (12.16) is the same as the graph of 


x Xo 
y | =| yo | +pa +gb. (12.17) 
Z Zo 


Because every point in the graph of (12.15) is in the graph of (12.17), we know that there are two points in 
(12.17) such that the vector between them is v. In other words v is in plane (12.17), as is w. Our work in 
Problem 12.6 then tells us that the plane generated by a and b is the same as the plane generated by v and 
w. We therefore conclude that the graphs of (12.15) and (12.17) are the same, which means that the graphs 
of (12.15) and nyx + nzy + n3z = d are the same. 


At long last, we have proved the following: 


_ Every plane is the graph of an equation of the form ax + by + cz = d, where a, b, - 


Important: 
- c, and d are constants. 


Make sure you see why this statement is different from, “The graph of the equation ax + by + cz = d, where a, 
b, c, and d are constants, is a plane.” 


Problem 12.10: In this problem, we find a vector that is orthogonal to every vector in the plane through the 
origin generated by v = 3i + 4j — k and w = 4i — 2j + 3k. 
(a) Find an equation of the form nx + nzy + n3z = 0 whose graph is the plane through the origin generated 
by v and w. 


Let n = mi + mj + n3k. Show that n is orthogonal to both v and w. 


Is n orthogonal to every vector in the plane? 


Solution for Problem 12.10: 


(a) The plane is the graph of 


x 3 4 
y | =sv+tw=s| 4 eet | -2 ]. 
Z = 3 


So, we have 


x= 3s+4t, 
y=4s-2t, 
z=-s+3t. 


Adding 3 times the equation for z to the equation for x gives x + 3z = 13t. Adding 4 times the equation 
for z to the equation for y gives y + 4z = 10t. Subtracting 13 times y + 4z = 10t from 10 times x + 3z = 13t 
eliminates t and leaves 

10x = 13y — 222 = 0. 


(b) Letting n = 10i — 13j — 22k, we have 


n - v = (10)(3) + (-13)(4) + (-22)(-1) = 0, 
n- w = (10)(4) + (-13)(—2) + (-22)(3) = 0, 


so n is orthogonal to both v and w. 
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(c) Let u be a vector from the origin to a point in the plane. By definition, such a vector must be a linear 
combination of v and w, so we have u = sv + tw for some constants s and t. Therefore, we have 


n-u=n-: (sv + tw) =n- (sv) +n- (fw) =sn-v+in-w=0, 


which tells us that n is orthogonal to every vector in the plane through the origin generated by v and w. 


The plane in Problem 12.10 passes through the origin. Let’s take a look at a plane that doesn’t pass through 
the origin. 


Problem 12.11: In Problem 12.7, we found that the plane through the points (0,4, 0), (3, —1, —3), and (2, —2, —4) 
is the graph of the equation x + 3y — 4z = 12. Let n = i+ 3j — 4k, where the components of n are taken from the 


coefficients of x + 3y — 4z = 12. Is n orthogonal to the vector between any two points in the plane? 


Solution for Problem 12.11: Let (x1, y1,21) and (x2, Y2,22) be two points in the plane, so 


X + 3Y1 — 4z; = 12, 
mata 3Y2 = 4Z = 12. 


The vector from the first point to the second is (x2 — x1)i + (y2 — y1)j + (Z2 — zı)k. These differences appear if we 
subtract the first equation above from the second equation. This subtraction gives 


(x2 — x1) + 3(y2 — yi) — 4(22 — z1) = 0. 


The expression on the left is (i + 3j — 4k) - ((x2 — x1)i + (y2 — y1)j + (22 — 21)k). Since this equals 0, the vector 
n = i + 3j — 4k is orthogonal to any vector between the two points in the plane. O 


We have confirmed that the vector ni + n2j + ngk is orthogonal to every vector in the plane 11x + nay + naz = d 
for two different planes. Our solution to Problem 12.11 gives us a guide to prove that this is the case for all planes. 


Problem 12.12: Show that the vector n = ni + Nj + nak is orthogonal to the vector between any two points 


in the graph of mx + nay + n3z = d, where ny, nz, ng, and d are constants. 


Solution for Problem 12.12: Let (x1, y1, z1) and (x2, y2, z2) be two points in the plane, so 


NyX1 + n241 + n3Zı =d, 
1X2 + ny2 + N3Z2 = d. 


Subtracting the first equation from the second gives 
Ny (X2 — x1) + n2(y2 — yi) + n3(z2 - z1) = 0, 


which we can write as 
(nii + mj + nk): ((x2 — x1)i + (y2 — y1)j + (z2 — Z1)k) = 0. 


Therefore, n is orthogonal to any vector between two points in the plane. 


Note that this isn’t the only vector that is orthogonal to every vector in the plane generated by v and w. We 
can multiply the equation above by any scalar k to yield 


(knyi + knj + knak) - (x2 — x1) + (y2 — y1)j + (z2 — 21)k) = 0, 


which tells us that kn is orthogonal to any vector in the plane, as well. Specifically, there are two directions that 
are normal to the plane, the direction of n and the direction opposite that of n, which is the direction of —n. O 
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Important: 


Vv 


This should look familiar—we showed back on page 325 that in two dimensions, the vector ai + bj is normal to 
the line ax + by = c. We then used that normal vector to find a formula for the distance between a point and a line. 
Let’s see if we can do something similar to find the distance between a point and a plane in three dimensions. 


A vector that is orthogonal to all vectors in a certain plane is said to be normal 
to the plane. The vector ai + bj + ck is normal to the plane ax + by + cz = d. | 


Problem 12.13: 
(a) Find an expression for proj,,v in terms of v - w, ||w||, and w. 


(b) The distance between a point and a plane is the length of the shortest segment between the point and | 
a point in the plane. Find an expression for the distance between the point (xo, yo, Zo) and the plane 
ax + by + cz +d = 0, where a, b, c, and d are constants. 


Solution for Problem 12.13: 


(a) Flip back to the solution to Problem 9.26 on page 327 and read through it closely, looking for any reference 
to the fact that the vectors are in two dimensions. You won’t find any. The entire proof holds for vectors in 
three dimensions, as well. So, we don’t have to do any more work to see that: 


Important: For any vector v and any nonzero vector w, we have = | i | 


; vw 
proj, Y = — zW. 
A 


(b) Looking back at two dimensions saves us a whole lot of trouble 
on this part, too. Rather than start from scratch, we do exactly 
what we did in Problem 9.26 in two dimensions. Let point A with 
coordinates (x,y,z) be on the plane, so that ax + by + cz +d = 0. 
Letting P be (xo, yo, Zo), we find the distance from P to the plane by 
finding the length of the projection of AP onto the vector normal 
to the plane. The vector n = ai + bj + ck is normal to the plane, so 


the desired length is 
AP-n || |AP-n 
zn = z | linll 
[In| [In| 
Inl? 
a aroe a y al 


linl 


Since ax + by + cz + d = 0, we have —ax — by — cz = d. Moreover, ||n|| = Va? + b2 + c2, so the desired distance 
between (xo, yo, Zo) and ax + by + cz + d = O is 


laxo + byo + czo + d| 


VE a Shee 


Now that we understand the graph of linear combinations of two vectors, let’s take a look at linear combinations 
of three vectors. 
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Problem 12.14: Let u, v, and w be vectors in three dimensions such that v and w are linearly independent. 
We will explore the graph of the parametric equation 


xX 
() =ru + sv + tw. (12.18) } 
Z 


(a) Show that if u is a linear combination of v and w, then the graph of Equation (12.18) is a plane. 


(b) Suppose that u is not a linear combination of v and w. Then, what is the graph of 


x 
(3) = u + sv + tw? 
Z 


(Notice that the coefficient of u here is 1, not r.) 


(c) Suppose that u is not a linear combination of v and w. Then, what is the graph of 


x 
(3) = 2u + sv + tw? 
Z 


How is your answer related to the graph in part (b)? 


(d) Suppose that u is not a linear combination of v and w. Then, what is the graph of Equation (12.18)? 


Solution for Problem 12.14: 


(a) Tfu is a linear combination of v and w, then u = pv + qw for some constants p and q. Then, Equation (12.18) 
becomes 


x 
(3) = ru + sv + tw = r(pv + qw) + sv + tw = (rp +s)v + (rg + t)w. 
z 


So, the graph consists of all linear combinations of v and w, which is the plane through the origin generated 
by v and w. 


uy 
(b) Letu= (=) . Then, the graph of 


U3 
x uy 
y | =u+sv+tw= | u2 | +sv+tw 
Z u3 


is the plane through (u1, u2, u3) generated by v and w. 


(c) The graph of 
x 2u 
y | =2u+sv+tw= | 2u | +sv+tw 
Z 2u3 


is the plane through (2u1, 2u2, 2u3) generated by v and w. As shown in Problem 12.5, this plane is parallel 
to the plane in part (b) because this plane is generated by the same vectors as the plane in part (b). To show 
that the planes in these two parts are not the same plane, we will show that it is impossible for the two 
planes to intersect. If they do intersect, then there must be some point (x, y, z) on both, so there must be 
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(d) 


scalars $1, ty, S2, t2 such that 


uy 2u 
(=) +sıv+ tiw = (zx) + Sov + fow. 
U3 2u3 
Rearranging this equation gives 
u 
(=) = (S1 — S2)v + (fi — f2)w. 
u3 


But this equation says that that u is a linear combination of v and w, which we know is not the case. 
Therefore, the plane in part (b) cannot intersect the plane in this part. 


Continuing in the vein of part (b) and part (c), for each possible value of r, the graph of 


x 
(3) = ru + sv + tw. (12.19) 
z 


is a plane through (ruı, ruz, ruz) generated by v and w. Moreover, for the same reason explained in part (c), 
all of these planes are parallel. So, by varying r, we produce a set of parallel planes. Intuitively, it seems 
like every point in space is included in one of these planes, but we have to prove it. 


We'll start by getting a little more geometric intuition for why 
each point is in one of these planes. In the diagram at right, 
plane P is the plane through the origin O generated by v and w. 
Obviously, the graph of (12.19) passes through all points in P, 
since P is the graph of (12.19) when r = 0. 


To show that the graph of (12.19) passes through any point A 
not in P, we consider the plane Q through A parallel to P. Since 
u, v, and w are linearly independent, when we draw u from the 
origin, it is not in P. Therefore, the line through the origin with direction u intersects Q at some point U, so 
U = ru for some r. Let B be the point in P such that OB Il {TA and OB = UA, so UOBA is a parallelogram 
(unless U is A, in which case A is obviously in the graph of (12.19)). 


B 


Since B is in P, we have B = sv + tw for some s and t. Since UOBA is a parallelogram, we have 
A = Ü + Ë = ru + sv + tw, which means A is in the graph of (12.19). 


This gives some geometric intuition for why the graph of 


x 
(5) =ru +sv+ tw. (12.20) 
Z, 


is all of R?. We’ll now prove that every point (x, y, Z) is on the graph. 


Let ax + by + cz = 0 be the plane P through the origin generated by v and w. Any point (xo, yo, Zo) 
in space is on a plane parallel to P, since for kọ = axo + byo + czo, the point (xo, yo, zo) is on the graph of 
ax + by + cz = ko. All we have left is to show that this plane is among the set of parallel planes produced by 
graphing (12.20). To do so, we just have to show that some multiple of u produces the plane ax+ by+cz = ko. 
Letting ky = au, + buz + cug, the point (u1, u2, u3) is on the graph of ax + by + cz = ky. Since u, v and w are 
linearly independent, we know that u is not in P, so we must have k, # 0. Therefore, we can multiply 
au, + buz + cu3 = ky by £ to give 


afm- 2) +6(u2- 2) +e (u E) = [y 
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which means that (u . ey u2: > u3- 2) is in the graph of ax + by + cz = kọ. This tells us that the graph of 
ax + by + cz = kg is the same as the graph of 


ae 

ko 
y | = —u +sv + tw, 
z ku 


which means (xo, Yo, Zo) is in the graph of (12.20). Therefore, every point in space is in the graph of (12.20). 
E 


Problem 12.14 allows us to extend our notion of linear dependence and linear independence to three vectors 
in three dimensions. We say that v1, vz, and v3 are linearly dependent if and only if there exist constants 41, a2, 
and 43, not all 0, such that a1v1 + a2v2 + 43v3 = 0. Otherwise, we say the vectors are linearly independent. Our 
work in part (d) shows that if u, v, and w are linearly independent, then the graph of 


X 
(3) = ru + sv + tw 
Z 


is all possible points (x, y, z). More succinctly, we say that this graph is R°. We also now know that there’s nothing 
to be gained by throwing more vectors into the mix—we can get all of R? with linear combinations of three linearly 
independent vectors, so we can’t get “more” by taking linear combinations of more than three vectors. 


Problem 12.15: Show that if u, v, and w are linearly independent, then for any (x, y,z), there is exactly one 
ordered triple (r,s, t) such that 


x 
í ) = ru + sv + tw. 
Z 


Solution for Problem 12.15: We know that there is at least one such ordered triple, since the graph of the parametric 
equation is R?. Suppose we have both 


x x 
(:) =rut+svt+t)w and () = rut svt tow. 
Z Z 


Subtracting the second equation from the first gives 


0 = (r1 — r2)u + (s1 — S2)v + (ty — t2)w. 


Since u, v, and w are linearly independent, we must have 1 — r2 = $1 -52 = t1 -t2 = 0,801) = 12,5; = S2,and fy = fp. 
Therefore, there cannot be two different ordered triples (r,s, t) that satisfy the equation for the same (x, y,z). Since 
we have showed that there is at least one ordered triple (r,s, t) that satisfies the problem, but there can’t be more 
than one, we conclude that there is exactly one such ordered triple. 0 


exercises B 


12.2.1 Find a vector with magnitude 1 that is normal to the graph of 4x — 3y + 12z = 5. 


12.2.2 Letn = ni + mj + nk. Show that if v is a nonzero vector such that n - v = 0, then there are two points A 
and B on the plane 11x + n2y + naz = d such that AB =v. 


12.2.3 Find the projection of 3i + 2j — 2k onto 4i — j + 8k. 
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3 
12.2.4 Find band cif | b | is normal to the graph of 2x — y+ 3x = 5. 
c 


12.2.5 Find all values of b such that the distance between (3, b, —5) and the plane x + 2y — 2z = 8 is 8. 


12.2.6 We can extend our definition of linear dependence to any numter of vectors. Specifically, the vectors 
V1; Vz, - - : , Vn are linearly dependent if and only if there exist constants 41, A2, - . - , An such that a] Vj +A2V2 +- * -+A Vn = 0 
and not all of the a; are 0. Otherwise, the vectors are linearly independent. Show that any set of four or more 
vectors in three dimensions is linearly dependent. 


12.3 The Cross Product 


In Problem 12.10 of the previous section, we found a vector n that is orthogonal to both v = 3i + 4j — k and 
w = 4i — 2j + 3k. We did so by showing that the plane through the origin generated by v and w is the graph of 
10x -- 13y — 22z = 0. We then showed that the vector n = 10i — 13j — 22k is orthogonal to any vector between two 
points in the plane. Specifically, n is orthogonal to both v and w. 


We start this section by generalizing this process, showing that we can go straight from the linearly independent 
vectors to the normal vector without bothering with the intermediate step of finding the plane generated by the 
linearly independent vectors. While doing so, we will discover a vector operation, the cross product, that we use 
in a great many applications of mathematics to physics and engineering. 


Problem 12.16: Find a nonzero vector that is orthogonal to linearly independent vectors v =yi+ vj + 03k 
and w = wi + Wj + wsk, in terms of the components of v and w. 


Solution for Problem 12.16: We can use our solution to Problem 12.10 as a guide. There, found an equation of the 
form nx + n2y + n3z = 0 whose graph is the plane through the origin generated by v and w. So, we start with the 
plane through the origin generated by v = vi + vzj + v3k and w = wii + Wj + w3k. Parametric equations whose 
graph is this plane are 

xXx=118S+ WE, 

Yy = v25 + Wot, 

Z = 035 + Ws, 
where s and t are parameters. We produce an equation of the form 1 x + n2y + n3z = 0 with the same graph as the 
parametric equations by eliminating the parameters. We first eliminate s by subtracting vz times the first equation 
from vı times the second equation, and by subtracting vs times the first equation from v; times the third equation. 
This gives us 

—U2X + UY = -VW iE + V{Wyt = (vW — vw )t, 


—U3X + 01Z = —03W1E + 01 W3t = (viw = vzw )t. 
Next, we set up elimination of t by multiplying the first equation above by (v;w3 — v3w1) and the second by 
(viw = U2W}): 


—02(01W3 — 03W1)xX + 01 (U1 W3 — V3W1)Y = (V1 W2 — V2W)(V1w3 — 03W)E, 


—03(01 W2 — VWI )x + 01 (vi W2 — 02W1 )z = (vi Wi => v2w1)(v1 W3 — vzw )t. 
Subtracting the first equation from the second eliminates t and leaves 
(—03(01 W2 — 02W1) + V2(v1W3 — v3W1)) X — V1 (01 Ws — V3W1)Y + 1 (V1 W2 — v2W1)z = 0. 


aE 
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Fortunately, the coefficient of x can be simplified a bit: 


—03(01 w2 = U2W}) + 02(01W3 = U3W}) = —0103W2 + V2V3W1 + 0102W3 — 0203W, = —0103W2 + U102W3 = 01(v2W3 = v3wW2). 
This leaves us 
01 (V2W3 = V3W )X = 01(01W3 os 03W1)Y + 01(01W2 = V2W})Z = 0 
If vı is nonzero, then we can divide both sides of the equation by v: to get 
(v2W3 = 03W2)Xx = (v1W3 = V3W1)Y F (viw = U2W})zZ =10) (12.21) 


(If vı = 0, we can go through the steps above slightly differently to produce this same equation. We eliminated s 
by subtracting multiples of our parametric equation for x from the parametric equations for y and z. If we started 
instead by subtracting multiples of the parametric equation for y from the equations for x and z, we'd end up with 
v2 times (12.21) rather than v; times it.) 


Equation (12.21) gives us the desired equation of the form ax + by + cz = d whose graph is the plane through 
the origin generated by v and w. We can read the components of a vector that is normal to the plane generated by 
v and w from the coefficients of x, y, and z. Therefore, the vector 


(v2W3 = U3W2)i = (v1W3 = 03W)j + (viw = v:w1ı)k 


is orthogonal to every vector in the plane generated by v and w. Specifically, itis orthogonal to v and w themselves. 
E 


For any two vectors v = vi + v2j + vsk and w = wi + w2j + wsk, we call the vector 
(v2W3 — 03W2)i — (v1w3 — V3W1)j + (v1w2 — v2w1)k 


the cross product of v and w, and we denote it as v x w. Fortunately, we have an easy way to remember this 
expression for the cross product. First, we notice that the expressions in parentheses above are determinants of 
2 x 2 matrices, so we can write v X W as 


02 U3 
W2 W3 


vi 02 
Wi w2 


vi U3 
W, W3 


Vxwe i- k. 


j+ 


The expression on the right side has the same form as one of our expressions for the determinant of a 3 x 3 matrix. 
So, we can borrow the notation of the determinant of a 3 x 3 matrix to remember the expression for the cross 
product of two vectors: 


U U2 


k=/0, vo U3]. 
Ty Os 1 U2 03 


Wi, W2 W3 


Wi, W3 


The vector given by v x w is not the only vector orthogonal to both v and w. Because (kp) -r = k(p - r) for 
any scalar k and any vectors p and r, we can multiply our expression for v x w above by any scalar and produce 
another vector that has a dot product of 0 with both v and w. We choose the particular vector above from among 
all of those that are orthogonal to both v and w because it has many convenient properties that we will explore in 
the next set of problems. 


Problems is 


Problem 12.17: Let v = 4i —j + 3k and w = —i + 2j + 7k. 
(a) Find vx wand w xv. 


(b) How are the two results you found in part (a) related? For any two vectors r and s, does the same 
relationship hold between r x s and s x r? 
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Problem 12.18: Show that for any scalar c and any two vectors u and v, we have u X (cv) = (cu) x v = c(u X v). 


Problem 12.19: Show that for any vectors u, v, and w in three dimensions, we have uX(v+w)=uxXvtuxw 


and (u+v)Xw=uxwt+vXxXw. 


u v1 ce ne 
Problem 12.20: Let u = | u2 | and v = ( 3 be nonzero vectors. In this problem, we show that we have 
U3 U3 $ 
jju x vi] = lull ||vi| sin 8, where @ is the angle between u and v. 
(a) Find |ju x v|| in terms of the components of u and v. 
(b) What trigonometric function of O do you already know how to express in terms of u and v? 


(c) Show that |u x v|| = llull livi] sin 0, where @ is the angle between u and v. 


Problem 12.21: Find i xj, j xk, and k xi. 


Problem 12.22: _ 
(a) Find (6i + 9j — 3k) x (2i + 3j — k). 


(b) Ifr=cs for some scalar c, then must we have r x s = 0? 


(c) Ifrxs=0Oand s + 0, then must we have r = cs for some scalar c? 
Problem 12.23: If u, v, and w are nonzero vectors such that u Xv = u X w, then must we have v = w? 


Problem 12.24: Let A = (1,3,2), B = (—3,5,3), and C = (0, —2, —6). Let P be the plane that contains all three of 
these points. In this problem, we find the equation whose graph is P. 
(a) Find a vector that is normal to P. 


(b) Find an equation whose graph is P. 


Problem 12.25: Let E be (1,0, —2), let F be (—2,3, -1), let G be (4, 2, —3), and let H be the point such that EFGH is 
a parallelogram. Find the area of EFGH. 


Problem 12.17: Letv = 4i — j + 3k and w = -i + 2j + 7k. 
(a) Find vx wand w xv. 


(b) How are the two results you found in part (a) related? For any two vectors r and s, does the same 
relationship hold between r x s and s x r? 


Solution for Problem 12.17: 


(a) We have 
i j k 
= A e Iae ; : 
vxw=/4 -1 Jen di- | itl 3 |k =-13i-31j +7k, 
=l 2 7 
i j k 
2 7 ee le 2 : 
wXv= H a 7 =|" Ji- alit] 4 Z [k= sita -7k 


(b) In part (a), we see that v x w = —w Xv. Let’s see why this relationship holds for any two vectors. Swapping 


I a 
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two rows of a matrix multiplies the determinant of the matrix by —1, so we have 


i j k i j k 
rxs=j|ń n f3|=—-|S1 S2 S3|=-sXr. 
5S1 52 53 mM n n3 


Important: For any two vectors v and w in three dimensions, we have 


VXW=-WXV. 


WARNING!! The cross product is not commutative. If v x w is nonzero, then it does not- 
å equal w x v. 


The relationship v x w = —w x v has a geometric significance that makes it easy to visualize. We defined 
v x w by looking for a vector that is orthogonal to both v and w. In Problem 12.16, we saw that such a vector is 
orthogonal to any linear combination of v and w. That is, the vector v x w is normal to the plane that is generated 
by v and w. 


However, there are two directions that are normal to this plane, one in each VX W ye 
direction away from the plane. The direction of v x w satisfies what we call the i 
right-hand rule. To find the direction of v x w, start by pointing the index finger 
of your right hand in the direction of v and the middle finger of your right hand 
along w in such a way that the middle finger is between your index finger and your 
palm, as shown at right. Then, extend your thumb “up” perpendicular to the plane 
generated by your index and middle fingers. This is the direction of v x w, as shown. 


WARNING!! 


NI The right-hand rule only works if you are using your right hand, not your 
a 


left hand. Hence, we have the name “right-hand rule!” 


The vector —Vv x w is in the exact opposite direction from v xw. We can visualize 
this with the right-hand rule. In finding w x v, you start with your index finger (of 
your right hand!) fully extended along w and your middle finger along v such that 
your middle finger is between your index finger and your palm. Then, extend your 
thumb perpendicular to the plane generated by these two fingers, and your thumb 
will be pointing in exactly the opposite direction from the way it pointed when we 
found v X w, as shown. 


Solution for Problem 12.18: Our expression of the cross product as the determinant of a 3 x 3 matrix makes this 
pretty straightforward: 


i j k i j k i j k 
Cu, CU? CU3| =cCļ|u1] U2 U3) = |ui un u3 |, 
U1 v2 U3 01 v 03 CVU, CU2 C3 
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so 
(cu) x v = c(u x v) = u x (cv). 


Problem 12.19: Show that u x (v + w) =uxv+uxwand(u+V)XW=uXxXW+VXW. 


Solution for Problem 12.19: Again, the properties we proved about determinants of 3 x 3 matrices save the day. We 
have 
i j k i j k i j k 
uUxv+uxw=]|ui uo us| +| u u3|=| u ur uz |=ux(v+w), 
01 U2 U3 wi Wo W3 Vi tw V tW V3 +t W3 
where we used the principle we proved in Problem 11.21 to combine the sum of determinants of matrices with 
two rows in common into one determinant. 


Combining this result with the fact that a x b = —b x a for all a and b, we have 


(ut+v)Xw=-w*X(ut+v)=—-wxXu-WXv=uxXwivxw. 


Combining Problems 12.18 and 12.19, we see that for any u, both f(v) = ux v and f(v) = v x ware linear, so the 
cross product is bilinear, just like the dot product. (However, as a reminder, the cross product is not commutative, 
but the dot product is.) _ 


Problem 12.20: Show that if 8 is the angle between nonzero vectors u and v, then |u x vj] = {lull |Ivllsin 8. 


Solution for Problem 12.20: We start by writing an expression for our target, |u x v|]: 


ane == 
uy u2 
IU U2 


2 


ung U3 
U2 V3 


y/ (u203 — U302)? + (U103 — U301)? + (u102 — u201)? 
n oa a o A 
= y U50 + u303 + UPU + UZU? + UTUS + UZU? — 2u2U30203 — ZU1V1U3V3 — 2UyU2V10>. 


That’s quite a mess. However, looking at the other side of the desired equation, ||ul| ||v||sin@, we have some 
hope, since terms like užu will appear in the product ||ull ||v||. We don’t immediately know how to express sin 8 
in terms of u and v, but we do know how to deal with cos 0, since u- v = |lull||v||cos 8. Combining this with 
sin? 0 = 1 — cos? 0, we can express |lul| ||v||sin @ in terms of the components of u and v. We start by finding 
(llull ||v|| sin 6), since this gets rid of the annoying square root signs: 


lul? Iiv? sin? 8 = Ijul? Iivi? (1 — cos? 0) 
AÀ 
= jul? lvi (1 n A | 
Irall? Iivi 
= |lul? IIvi? — (u - v)? 


= (uf + u$ + ub)(vt + v3 + 03) — (U101 + U02 + 303). 


uy U3 


lu x vil 5 
1 03 


The terms u?07, 1305, and u3v3 appear in both the expansion of the product and of the square, so these cancel when 
we subtract, and we're left with 
2 Igli? a; 
lul? livi? sin? 6 = u50} + u3v3 + utv5 + U30? + 205 + UZU — 2u2U30203 — 2U1V1U3U3 — 2u, 20109, 


which is exactly the square of the expression we found for ||u x vl, so Iju x v|? = llul? lvl? sin? 0. Since we 
have 0 < O < 7, we know that |[ul||[v||sin 6 is nonnegative, so taking the square root of both sides gives us 
llu x vl] = |[ul] [Iv] sin @. 5 
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Important: For any nonzero vectors u and v in three dimensions, we have 
y llu x vi] = [ull livi} sin 6, 


where 0 is the angle between u and v. 


(Problem 12.21: Findixj,jxk,andk xi. i 


Solution for Problem 12.21: We could use the determinant formula for the cross product z 
to compute the results, but we can also find them with a little geometric intuition. 
In each case, both vectors have magnitude 1, and the angle between the vectors is 5, 
so the relationship ||v x w|| = |lvi||lwl|sin@ tells us that the magnitude of each cross 
product is 1, since sin 5 = 1. 


All we have left is to determine the direction of each cross product. Each of i, 
j, and k is orthogonal to the other two. Combining this with the fact that i x j has 
magnitude 1, we know that i Xj is either k or —k. Applying the right-hand rule reveals 
that i x j = k. Similarly, we find j x k =iandk xi =j. 


We can visualize these relationships with the diagram at right. For any cross product of the two 
vectors, if we go in the direction of the arrows (which is in the direction of alphabetical order starting 1 e 
with i) to get from the first vector in the cross product directly to the second, then the cross product 
equals the third vector. If we must go in the opposite direction of the arrows, such as k x j, then the k J 
result is the negative of the third vector, so k x j = —i. O 


Problem 12.22: 
(a) Find (6i + 9j — 3k) x (2i + 3j — k). 


(b) Ifr=cs for some scalar c, then must we have r x s = 0? 


(c) Ifrxs=Oands #0, then must we have r = cs for some scalar c? 


Solution for Problem 12.22: 


(a) We have 
i j k 
uxv=|6 9 -3| =0Q0i+0j+0k=0. 
2 ð =Í 
We also could have seen that this determinant equals 0 by noting that the second row is a constant times 
the third. 
(b) Our final observation in part (a) makes it clear that the answer to this part is yes. Letting s = s1i + s2j + s3k, 
we have 
i j k 
rxs= (cs)Xs= |cS1 CS2 cs3|= 0, 
Si S2 $3 


since the second row is a constant times the third. We can also see that r x s = 0 if r = cs by noting that the 
angle 0 between nonzero vectors s and cs is 0 if c > 0 and 7 ifc < 0. In either case, we have sin 0 = 0, so 


llr x sļ| = [IrII [Is|| sin @ = 0. 


Therefore, we must have r x s = 0. 


We also could have noted that (cs) x s = c(s x s). Since a x b = —b x a for any a and b, letting a = b = s 
gives s X s = -=s Xs,sos Xs = 0. Therefore, we have (cs) x s = c(s x s) = 
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(c) Since |r x s|| = |{r|| |Is|| sin @, where 0 is the angle between r and s, we must have r = 0 or sin 0 = 0 ifrxs=0 
ands + 0. Ifr = 0, then r = Os. If sin@ = 0, then 0 = O or 0 = x. If 0 = 0, then r and s are in the same 
direction, so r = cs for some positive c. If 0 = n, then rand s are in opposite directions, so r = cs for some 
negative c. Therefore, in all possible cases, there is a scalar c such that r = cs. 


Important: We haver Xs = 0 if and only if r and s are linearly dependent. 


‘Problem 12.23: If u, v, and w are nonzero vectors such that u x v = u x w, then must we have v = w? 


Solution for Problem 12.23: We subtract u x w from both sides and we have 
uxv-uxw=0. 
Applying the properties we proved in Problems 12.18 and 12.19, we have 
uX(v—w) =0. 


Since there are vectors besides 0 whose cross product with u is 0 (namely, ku for any scalar k), we cannot conclude 
that v — w = 0 from the fact that u x (v — w) = 0. Therefore, we cannot conclude that v = w. O 


WARNING!! If nonzero vectors u, v, and w satisfy uxv=ux w, then we cannot conclude l 
29 that v and w are equal. 


Problem 12.24: Let A = (1,3, 2), B = (-3,5,3), and C = (0, —2, —6). Find an equation whose graph is the plane 
that passes through all three points. 


Solution for Problem 12.24: We know how to find an equation of a plane given a point in the plane and a vector 
normal to the plane, so all we have to do is find a vector normal to the plane through A, B, and C. We've just 
learned a tool for finding a vector normal to two other vectors: the cross product. So, to find a vector normal to 


the plane, we find AB x AC. We have 


AB = B - A = (-3-1)i+ (5 —3)j + (3 —2)k = —4i + 2j +k, 
AC = C-A =(0-1)i + (-2-3)j + (-6 - 2)k = -i - 5j — 8k. 
Therefore, we have 
ee e 
ABxAC=|-4 2 1] =-11i-33j + 22k, 
=|] 255.8 


so —11i — 33j + 22k is normal to the plane through A, B, and C. Factoring out 11, we can write this vector as 
11(-i-3j + 2k), so —i -3j + 2k is normal to the plane, and the desired equation is —x —3y + 2z = d for some constant 
d. Point A is on the graph of this equation, so we substitute (1,3, 2) into the equation to find d = —6. Therefore, the 
graph of —x — 3y + 2z = —6 is the plane that passes through A, B, and C. 0 


Problem 12.25: Let E be (1,0,—2), let F be (—2,3,—1), let G be (4, 2, —3), and let H be the point such that EFGH 
is a parallelogram. Find the area of EFGH. j 
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Solution for Problem 12.25: The area of a parallelogram equals the length of a side times the H G 


distance between that side and the opposite side. We can easily find the side lengths of the 
parallelogram, but finding the distance between opposite sides is cumbersome. Instead, we 


note that drawing a diagonal of the parallelogram divides it into two congruent triangles, E Ẹ 
as shown at right. So, the area of EFGH is twice the area of AEFG. But how will we find [EFG]? We turn to 
trigonometry, and we have 


[EFGH] = 2[EFG] = 2 (SPIE) sin LEEG) = (EF)(FG) sin ZEFG. 


This expression looks familiar! Back in Problem 12.20, we found that ||u x vl| = ||ul| livi] sin 0, where 9 is the angle 
between u and v. Applying this here, we have ||FE x FG|| = (EF)(FG) sin ZEFG, so the area of A equals ||FE x FG||. 
Since FE = (1—-(-2))i+(0-3)j + ((-2)-(-1))k = 3i-3j —k and FG = (4—(—2))i+(2-3)j +((-3)-(-1))k = 61-j—2k, 


we have 
i j k 
det| 3 -3 -1 
6 -1 -2 


In the process of solving Problem 12.25, we found another relationship between geometry and the cross 
product. 


Area of A = \|FE x FG| = 


= ||5i + Oj + 15k|| = V52 +152 = 5 V10. 


o 


Important: The area of a parallelogram with v and w as sides is ||v x wll. 


|| Exercises E 


3 ži 4 
Peal berate hbal 7 racs —1 |. 
-1 0 3 


(a) Findaxb. 

(b) Findbxec. 

(c) Findcxa. 
12.3.2 Find an equation whose graph is a plane through (—1, 3, 4), (2, 1, -5), and (-7, -1, 0). 
12.3.3 Find nonzero vectors u, v, and w such that u x v = u x w, but v # w. 
12.3.4 Find the area of a triangle with vertices (—1, 3, 0), (—2,5, 6), and (6,0, —4). 
12.3.5 Show that |ju x v|? = llul}? ilv]? — (u - v)*. This is called the Lagrange Identity. 
12.3.6* 


(a) Show that a x (b x c) = (a - c)b — (a: b)c. 
(b) Show that a x (b x c) + b x (c Xa) + c x (aX b) = 0. This is called the Jacobi Identity. 
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12.4 Geometric Interpretation of the Determinant 


In Section 10.5, we found a geometric interpretation of the determinant of a 2 x 2 matrix. This interpretation de- 
scribed the determinant in terms of area. Moving up to three dimensions, maybe we can describe the determinant 
of a3 x 3 matrix in terms of volume. 


In two dimensions, we linked the determinant to the area of a parallelogram. The 
three-dimensional analogue of a parallelogram is a parallelepiped, which is a prism 
in which all six faces are parallelograms. An example is shown at right. 


[l Probiems ig 


Problem 12.26: Let O be the origin, U be (u1, u2, u3), V be (vı, 02,03), and W be (w1, w2, w3). Also, let u = uA 
v = V,and w = W. In this problem, we show that the volume of the parallelepiped with OU, OV, and OW as 
edges is |u - (v x w)l. 

(a) Let P be the parallelogram with sides OV and OW. Find the area of P in terms of v and w. 

(b) Find the distance from U to the plane of P in terms of u, v, and w. 


(c) Show that the volume of the parallelepiped with OU, OV, and OW as edges is |u - (v x w)I. 


Problem 12.27: Let u = mi + uj + ugk, v = vii + vj + vsk, and w = wii + Woj + W3k. Express u-(v x w) as the 
determinant of a 3 x 3 matrix whose entries are components of u, v, and w. 


Problem 12.28: Find the volume of the tetrahedron with vertices (3, 4, —1), (—2,3,5), (4, —1, 0), and (—5, 5, 2). 


3 -1 4 
Problem 12.29: Let A = | -1 1 -2 }. 
-5 2 -7 


(a) Find det(A). 


(b) Why does your answer to part (a) tell you that the vectors 


—5 
| and ( 2 ) are not linearly 
-7 


independent? 


Problem 12.30: Let M be a 3 x 3 matrix. 
(a) Show that det M = 0 if and only if the rows of M are linearly dependent. 


(b) Show that det M = 0 if and only if the columns of M are linearly dependent. 


We found in Section 10.5 that if two given vectors from the same point represent the sides of a parallelogram, 
then the signed area of the parallelogram equals the determinant of the matrix with these two vectors as columns 
(or rows). A reasonable guess in three dimensions is that the determinant corresponds to the volume of a 
parallelepiped. So, let’s see if we can find an expression for the volume of a parallelepiped in terms of three 
vectors that represent edges from the same vertex of the parallelepiped. 


Ü, E 


OW as edges is |u - (v x w)|. 


Problem 12.26: Let O be the origin, U be (u1, u2, u3), V be (v1, 02,03), and W be (wi, w2, w3). Also, let u = 
v = V, and w = W. Show that the volume of the parallelepiped with OU, OV, and 
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Solution for Problem 12.26: The volume of a parallelepiped 

equals the area of a face times the distance between that 

face and the opposite face. Each face of a parallelepiped : 

is a parallelogram, and we know how to find the area of Prolvxw" 
a parallelogram that has two vectors as consecutive sides. 
We let Y be the point such that OVYW is a parallelogram, 
and let 0 = ZVOW. Then, 


O 


[OVYW] = (OV)\(OW)sin@ = Iiv x wll. 


To find the distance between OVYW and the opposite face, we find the distance must U to OVYW. We 


discovered how to do this in Problem 12.13; this distance is the magnitude of the projection of U onto a vector 
normal to the plane of OVYW. We know how to find such a normal vector: v x w is normal to the plane generated 


by v and w. So, since U = u, the desired height is proj,,.,,u, as shown in the diagram. This gives us 


Volume = |[projy.,tl| [OVYW] = || A a w)||llv x wil 
llv x wll 
lu- vxw 
llv x wll 
= |u : (v x w)]| 


O 


We can use signed volume to get rid of the absolute value sign in |u : (v x w)|, similar to how we used signed 
area in two dimensions. Specifically, we say the volume of the parallelepiped with edges u, v and w is positive 
if the vector given by v x w makes an acute angle with u, and the volume is negative if v x w makes an obtuse 
angle with u. Another way to think about this is with the right-hand rule. If u is on the same side of the plane 
generated by v and w as indicated by the right-hand rule applied to v x w, then the volume is positive. If u is on 
the opposite side, then the volume is negative. 


Let O be the origin, U be (u1, u2, u3), V be (v1, v2, v3), and W be (w1, w2, w3). Also, 
let u = Ü, v =; and w = W. The signed volume of the parallelepiped with 
OU, OV, and OW as edges is u - (v x w). 


Important: 


We call u - (v x w) a box product or scalar triple product of u, v, and w. 


Now, we're ready to relate the determinant of a 3 x 3 matrix to volume. 


Problem 12.27: Letu = mi + uj + usk, v = vii + vj + vsk, and w = wii + w2j + w3k. Express u- (v x w) as the 


determinant of a 3 x 3 matrix whose entries are components of u, v, and w. 


Solution for Problem 12.27: We have 
i j k 


7 A r : U2 V3 v «U3 Vy V2 
. = (umi + uj +u3k): |v v2 v3|=(mi+u + uk): ( k 
u- (v xX w) = (u 2j +uzk): |v v2 03) = (u 2j + u3k) w w wi w wi wz 
wi W2 W3 
v v v v v v 
Si 2 3 ae 1 I a [ 2 
W2 W3 w1 W, W2 
Uy, Uo Wy 
= |V UV 03 
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Therefore, we can express the volume of a parallelepiped as a box product, and we can express the box product 
as a determinant. Putting these together, we have: 


Let O be the origin, U be (u1, u2, u3), V be (v1, v2, v3), and W be (w1, w2, w3). The 
determinant 


Important: — 


Y 


u u ug| 
Vy U2 U3 
W, W2? W3 


equals the signed volume of the parallelepiped with OU, OV, and OW as edges. f 


We noted on page 367 that if M is a 2x2 matrix and v and w are vectors in 2 dimensions, then the parallelogram 
with Mv and Mw as sides has signed area det(M) times the signed area of the parallelogram with v and w. 
Analogously, if N is a 3 x 3 matrix and r, s, and t are vectors in 3 dimensions, then the parallelepiped with edges 
Nr, Ns, and Nt has signed volume det(N) times the signed volume of the parallelepiped with edges r, s, and t. 


This observation gives us an intuitive understanding for the following: 


B y © det(AB) = det(A) det(B). 


Specifically, if we multiply each of r, s, and t by AB, we get the same three vectors as if we multiply each first by B 
and then multiply the resulting products by A. So, scaling the signed volume in one step by det(AB) gives the same 
signed volume as first scaling by det(B) and then scaling the result by det(A). Therefore, det(AB) = det(A) det(B). 
(Note that this isn’t a proof; it is an intuitive explanation meant to make you have a better feel for the result. 
Proving that det(AB) = det(A) det(B) requires some more serious machinery, or a lot of patience with algebra.) 


Problem 12.28: Find the volume of the tetrahedron with vertices (3, 4, =1), (—2,3,5), (4, —1,0), and (—5, 5, 2). 


Solution for Problem 12.28: First, we give the points the labels O, A, B, C, respec- 
tively. We know how to find the volume of a parallelepiped, so we try to relate the 
tetrahedron to a parallelepiped we can handle. Specifically, we know how to handle 
the parallelepiped with OA, OB, and OC as edges, so we try to relate the desired 
tetrahedron to this parallelepiped. 


Let D be the point such that AOBD is a parallelogram, let 7 be tetrahedron OABC, 
let h be the height from C to AOBD, and let P be the parallelepiped with OA, OB, and 
OC as edges. Then, we have 


M[AOB] _ h({AOBD]/2) _ MAOBD] _ Volume(?) 


Volume(7 ) = 3 3 - n 


By Problem 12.27, the volume of P equals the absolute value of the determinant of the matrix with OA, OB, 
and OC as rows. Therefore, we have 


1 (-2)-3 3-4 5-(-1) 1 -5 -1 6 143 
Volume(7) = = |det{| 4-3 (-1)-4 O-(-1) ]|/=<=Idet| 1 -5 1 }/=—. 
(25)=3 eaa 2 ays eel 6 
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3 -1 4 
Problem 12.29: Let A= | -1 1 -2 |. 
- 2 -7 


(a) Find det(A). 


= 
(b) Why does your answer to part (a) tell you that the vectors ( ,and ( 2 ) are not linearly 
=y 


independent? 


Solution for Problem 12.29: 
(a) We have 


det(A) = (3)(1)(-7) + (-1)(-2)(-5) + (4)(-1)(2) — (3)(-2)(2) — (-1)(-1)(-7) - (4(1)(-5) 
es (1?) (7) (20) = 0. 


(b) By Problem 12.27, the determinant equals the signed volume of the parallelepiped with edges represented 


3 -1 —5 
by the vectors u = | -1 |, v= 1 J], ,andw= | 2 |. Since the determinant is 0, we know that the 
4 —2 -7 


volume of this parallelepiped is 0. That is, the distance from the head of u to the plane generated by v and 
w is 0. This means that u is in the plane generated by v and w, so u is a linear combination of v and w. 


Oo 


| Problem 12.30: Let M be a 3 x 3 matrix. 
(a) Show that det M = 0 if and only if the rows of M are linearly dependent. 
(b) Show that det M = 0 if and only if the columns of M are linearly dependent. 


Solution for Problem 12.30: 


(a) We have already done all the heavy lifting for this problem. Suppose U, V and W are points in space, and 
let M be the matrix whose rows are U, V, and W. Let P be the parallelepiped with edges OU, OV, and OW. 
As usual, we let O be the origin. 
The volume of P is 0 if and only if U, V, and W are in the same plane, which is true if and only if these 
vectors are linearly dependent. Since det(M) equals the signed volume of P, we therefore have det(M) if 
and only if Ü, V, and W are linearly dependent. 


f a 3x3 matrix is 0 if and only if its rows are linearly | 


Important: The determinant o 


Y dependent. 


(b) In Problem 11.16, we showed that det(M) = det(M!). From part (a), det(M‘) = 0 if and only if the rows of 
M’ are linearly dependent. The rows of M” are the columns of M! So, we have det(M) = 0 if and only if 
the columns of M are linearly dependent. 


Important: The determinant of a 3 x 3 matrix is 0 if and only if its columns are linearly 


Vv dependent. 
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Combining parts (a) and (b) of this solution, we see that: 


parent The rows of a 3 x 3 matrix are linearly dependent if and only if the columns of 
Y the matrix are linearly dependent. 


| 


12.4.1 The point (—2, 1,5) is one vertex of a parallelepiped, and it is connected by edges of the parallelepiped to 
(4,-1,-2), (-1,2,-1), and (4, -3,0). What is the volume of the parallelepiped? 


12.4.2 Show thata- (b xc) =(axb)-c. 


12.4.3 Suppose that each two rows of matrix M are orthogonal. Show that the product of the magnitudes of the 
rows equals | det(M)|. 


12.5 Inverse of a3 x3 Matrix 


As with 2 x 2 matrices, an inverse of a 3 x 3 matrix A is a matrix B such that BA = I. We denote the inverse of A 
as A™. In Section 10.6, we showed that a 2 x 2 matrix has an inverse if and only if the determinant of the matrix 
is nonzero. In this section, we prove the same condition for the existence of an inverse of a 3 x 3 matrix. 


i Problems > 


Problem 12.31: As a reminder, we say that function f that maps vectors to vectors is a linear function if 
(i) f(av) = af(v) for all scalars a and vectors v, and 
(ii) f(v + w) = f(v) + f(w) for all vectors v and w. 
Show that if g is a linear function that maps vectors to vectors (in three dimensions), then there is a matrix A 


such that g(w) = Aw for all w. Is this matrix unique for each function g? 


Problem 12.32: 
(a) Show that if det(A) + 0, then there exists a function f such that for every c, we have A(f(c)) = c. 


(b) Show that the function you found in part (a) is linear. 


Problem 12.33: 
(a) Show that if det(A) # 0, then there is a matrix B such that BA = I. 


(b) Is the matrix B you found in part (a) unique? 
(c) Do we also have AB = I for the matrix B you found in part (a)? 


Problem 12.34: In this problem, we show that if det(A) = 0, then A does not have an inverse. 
(a) Show that if det(A) = 0, then there is some vector c such that there is no solution v to Av = c. 


(b) Use part (a) to show that A does not have an inverse if det(A) = 0. 


12.5. INVERSE OF A 3 x 3 MATRIX 


Problem 12.31: As a reminder, we say that function f that maps vectors to vectors is a linear function if 


(i) f(av) = af(v) for all scalars a and vectors v, and 


(ii) f(v + w) = f(v) + f(w) for all vectors v and w. 


Show that if g is a linear function that maps vectors to vectors (in three dimensions), then there is a matrix A 
such that g(w) = Aw for all w. Is this matrix unique for each function g? 


Solution for Problem 12.31: Back on page 354, we proved the equivalent statement in two dimensions. We use the 
same approach here. 


We can write any vector v as vi + v2j + vsk. Applying item (ii) in our definition of a linear function gives 
gi + vj + vsk) = g(v1i) + g(v2j) + gvk). 
Then, applying item (i) to each term on the right, we have 
gmi + vj + v3k) = g(v1i) + g(v2j) + gvk) = 01.9(i) + v2g(j) + v3g(k). 


Since g maps vectors to vectors, we have 


411 412 413 
gi) = | aa J,  gj)=|azn |,  gk)= |as}, 
431 432 433 


for some scalars 411,412, . . . ,a33. Therefore, we have 


Vi 411 412 413 V1411 + V2412 + V3413 441 a12 413 V1 
E U2 =U, | doi | +02 | 42 | +03 | 423 | = | V121 + 02422 + V3423 | = | 421 422 a23 v j. 
U3 431 432 433 01431 + V2432 + V3433 431 432 433 U3 


This tells us that there exists a matrix A such that g(v) = Av for all vectors v. Moreover, the matrix is unique, since 
its columns, in order, must be @(i), g(j), and g(k). 0 


Important: Any linear function g that maps vectors in R? to vectors in R? can be expressed 


as multiplication by a unique 3 x 3 matrix, where the columns of the matrix are _ 


g(i), g(j), and g(k). 


Problem 12.32: 
(a) Show that if det(A) + 0, then there exists a function f such that for every c, we have A(f(c)) = c. 


(b) Show that the function you found in part (a) is linear. 


Solution for Problem 12.32: 


(a) Weshowed on page 397 that if det(A) # 0, then the equation Av = c has a unique solution v for each vector 
c. Since for each c there is a unique v such that Av = c, we can define a function f(c) to return the vector v 
such that Av = c. 


(b) We must show the following: 
(i) f(ke) = kf(c) for all scalars k and vectors c, and 
(ii) f(b +c) = f(b) + f(c) for all vectors b and c. 
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We tackle item (i) first. Let f(c) = v, so Av = c. Multiplying both sides by k gives kAv = ke. Since 
kAv = A(kv), we have A(kv) = ke. Therefore, we have f(kc) = kv = kf(c). 


Turning to item (ii), let f(b) = u, so Au = b. Adding Av = c to this gives Au + Av = b +c, so 
A(u + v) = b + c. Therefore, we have f(b +c) = u + v = f(b) + f(c). 


Since we have showed that f satisfies conditions (i) and (ii) above, we conclude that f is linear. 


Problem 12.33: 
| (a) Show that if det(A) + 0, then there is a matrix B such that BA = I. 


(b) Is the matrix B you found in part (a) unique? 


(c) Do wealso have AB = I for the matrix B you found in part (a)? 


Solution for Problem 12.33: 


(a) We did all the heavy lifting in Problem 12.32. There, we showed that if det(A) # 0, then there is a linear 
function f such that, for every c, the vector f(c) is the vector v such that Av = c. Because the function is 
linear, there exists a matrix B such that f(c) = Bc. Therefore, if Av = c, then Bc = v. Multiplying both sides 
of Av = con the left by B, we have 

BAv = Bc=v 
for all v. Therefore, we must have BA = I. 

(b) The linear function we found in Problem 12.32 is unique for each A, since it is determined by the unique 

solutions to Av = i, Av = j, and Av = k. In Problem 12.31, we showed that a linear function that maps 


vectors to vectors can be expressed uniquely as multiplication by a matrix. Therefore, the matrix B such 
that BA = I is unique. 


(c) As explained in part (b), we have Bc = vif Av = c. Multiplying both sides of Bc = v on the left by A, we 
have 
ABc = Av=c 


for any vector c, so AB = I. In other words, if B is the inverse of A, then A is the inverse of B. 


Important: ~ Tf det(A) + 0, then there is a unique matrix A~!, called the inverse of A, such. 


V that AA = AA7! =I. 


As with 2 x 2 matrices, we say that a matrix is invertible if it has an inverse. We found that a 2 x 2 matrix is 
not invertible if its determinant is 0. Let’s see if the same is true of a 3 x 3 matrix. 


Problem 12.34: In this problem, we show that if det(A) = 0, then A does not have an inverse. 
_ (a) Show that if det(A) = 0, then there is some vector c such that there is no solution v to Av = c. 


(b) Use part (a) to show that A does not have an inverse if det(A) = 0. 


Solution for Problem 12.34: 


(a) Let the columns of A be aj, az, and a3. Since det(A) = 0, the columns of A are linearly dependent. So, as 
explained in Problem 12.14, the graph of 


x 
(3) = Via, + U2a2 + V383, 
Z 


SBC E a 
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where v, V2, V3 are parameters, is not all of R?. In other words, there is some vector c such that there are no 
values of vı, V2, v3 for which c = via; + V2a2 + V3a3. Since 


Vijay + U2a2 + V3a3 = Vv, Ai + U2 Aj SP v3Ak = A(vıi + VJ oF v3k), 


there is no vector v for which c = Av. 


(b) If A! exists, then we can multiply both sides of Av = c on the left to get AAv = A`™!c, so v = Ale. In 
other words, if A is invertible, then for any c, the vector v = A7'cis a solution to Av = c. But we showed in 
part (a) that if det(A) = 0, then there is some vector c such that there is no solution v to the equation Av = c. 
So, we conclude that A is not invertible if det(A) = 0. 


Important: A 3 x 3 matrix has an inverse if and only if the determinant of the matrix is 


nonzero. 


As a reminder, we say that a 3 x 3 matrix is singular if its determinant is 0 and nonsingular if its determinant 
is nonzero. So, a 3 X 3 matrix has an inverse if and only if it is nonsingular. 


|| Exercises BE 


12.5.1 If A is nonsingular, then how are det(A) and det(A“) related? 


1 5 5 
12.5.2 Find all x such that | -2 6 x | does not have an inverse. 
x -2 1 


12.5.3 Must the sum of two singular matrices be singular? 


12.5.4 A 3 x 3 matrix is an upper triangular matrix if all the entries below the main diagonal are 0. That is, it is 


a matrix of the form 
11 A2 M3 
0 an 43 
0 0 433 


(a) Show that if all the entries on the main diagonal of an upper triangular matrix (411, 422, and a33 above) are 
nonzero, then the matrix is invertible. 


(b) Show that if an upper triangular matrix is invertible, then its inverse is also an upper triangular matrix. 


12.6 Summary 


The vectors v1, V2,- . ., Vn are linearly dependent if there exist constants 41, 47,...,4n, not all 0, such that we have 
41V1 + 42V2 + +- + An Vn = 0. Otherwise, we say the vectors are linearly independent. 


x Xo 
The graph of | y | = | Yo | + tv isa line through (xo, Yo, Zo) with direction vector v. 
Z Zo 
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If v and w are linearly independent, then the graph of 


x Xo 
y | =| Yo | +sv+tw 
Z Zo 


is the plane through (xo, yo, zo) generated by v and w. Any two linearly independent vectors in a plane can be 
used to generate the plane. A vector is normal to a plane if it is orthogonal to every vector in the plane. The graph 
of any equation of the form ax + by + cz = d, where a, b, and c are not all 0, is a plane. Every plane is the graph of 
an equation of this form, and the vector ai + bj + ck is normal to the graph. 


x 
If u, v, and w are linearly independent, then the graph of (3) = ru + sv + tw is all of R°. 
Z 
We define the cross product of two vectors as 
v v v v v v JTK 
VxWeH= i is i- - on os ne k= Ui V2 U3). 
2 W3 1 W3 1 W2 WW, Ws 


The cross product v x w is orthogonal to both v and w, and has the following properties: 


e v X w satisfies the “right-hand rule,” which is illustrated at right 
e VXW=-WXV 
e u X (cv) = (cu) Xv = c(u x v) 


e ux(v+w)=uxv+uxwand(u+v)xw=uxw+vxw 


lla x vl] = llull livi] sin 8 


e r Xs = 0 if and only if r and s are linearly dependent 


The signed volume of the parallelepiped with u, v, and w as edges is u - (v x w), where u : (v x w) is called a 
box product or scalar triple product. We can compute such a product with a determinant: 
Uy u2 W3 


u-(VvXW)=|% v 03). 
wi Wr W3 


The determinant of a 3 x 3 matrix also has the following properties: 


e If A and B are 3 x 3 matrices, then det(AB) = det(A) det(B). 
e The determinant of a3 x 3 matrix is 0 if and only if its rows are linearly dependent. 


e The determinant of a 3 x 3 matrix is 0 if and only if its columns are linearly dependent. 


If det(A) + 0, then there is a unique matrix A™', called the inverse of A, such that ATA = AA =, K 
det(A) = 0, then A does not have an inverse. 
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| REVIEW PROBLEMS > 
1 —3 x 
12.35 Letv= (2m (4) map (). 
—5 —2 Z 


(a) What is the shape of the graph of all points (x, y, z) such that p : v = 4? 


(b) What is the shape of the graph of all points (x, y,z) such that p- v = p- w? 
(c) What is the shape of the graph of all points (x, y, z) such that p : v = 4 and p: w = 4? 


12.36 Find a and b such that (a, 2, b) is on the line through (2, 4,2) and (1,6, —1). 
12.37 Find a and b if the graphs of 3x — 2y + 8z = 5 and ax + by — 4z = 7 are parallel planes. 
12.38 Let £ be the line through (5,4, 1) and (11, 1, —8) and m be the line through (5, —1, 2) and (0, 14, 7). 


(a) Show that £ and m intersect, and find the point at which they intersect. 
(b) Find an equation whose graph is a plane that contains both f and m. 


12.39 Let (x1, y1,21), (X2, Yz, Z2), and (x3, y3, 23) be three noncollinear points. Show that the graph of 


X= Xi y— yi Z= Z] 
ee Yy2-7Yı1 Zz -2ı|=0 
x3 Xi %Y3-Yi 243721 
is the plane through these three points. 
4 -1 
1240 Letv= | -1 |] andiw= | 2 |. 
2 3 
9 
(a) Isp = | —4 | alinear combination of v and w? If so, find all possible ordered pairs of scalars (a, b) such 
1 


that p = av + bw. 


(b) Suppose ris a linear combination of v and w. Show that there is only one ordered pair of scalars (a, b) such 
that r = av + bw. 


(a) Ifu-w=v-w forall vectors w, then must u = v? 


(b) Ifuxw =v xw for all vectors w, then must u = v? 
12.42 Find (3i + 2j — k) x (-3i + 4j + 2k). 
12.43 Ifi x v =k, then must v = j? 


12.44 Find an equation of the form ax + by + cz = d whose graph is the plane through the origin generated by 
4 =] 
-1 | and | 0 }. 
2 4 


n IO 
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12.45 Find an equation whose graph is the plane that contains both (—2,3,5) and the graph of the parametric 
equations x = 3-2t,y=4+1t,z = —5t. 


12.46 Ifaxb=candb x c= a, then must c x a = b? 
12.47 


(a) Letv = 4i -j + 2k. Find a matrix M such that for any vector w, we have Mw = v x w. 


(b) Show that for any nonzero vector u, there is a matrix N such that Nw = u x w for any vector w. 
12.48 Show that if u -v = 0 and u x v = 0, then either u = 0 or v = 0. 


12.49 Use the geometric interpretation of the determinant to explain why 


ui U2 U3 Uui tU U +U: U3 +t 03 
01 U2 ws U4 U2 U3 
Wy W2 W3 w w2 w3 


12.50 Find the inverse of each of the following: 


oa o 0 0 1 6 0 s 
(a) 10 0 © {100 (c) 1o © 
woi 010 01 0 


12.51 Suppose that A and B both have inverses. Find the inverse of AB in terms of A~ and B”. 
12.52 If P is the product of an invertible matrix and a non-invertible matrix, then does P have an inverse? 


12.53 Suppose A is nonsingular. Then, do we have (A‘)"! = (A7!)7? 


12.54 We can project a vector on a plane just as we can project it onto a line. Suppose we draw the vector v from 
a point O in the plane, where v is not in the plane. If we let V be the head of v, and let F be a point in the plane 


such that FV is normal to the plane, then the vector OF is the projection of the vector onto the plane. 


Challenge Problems _ 


(a) Find a matrix M such that Mv is the projection of v onto the xy-plane (the graph of z = 0) for all vectors v. 
(b)* Find a matrix N such that Nv is the projection of v onto the graph of x + y +z = 0. Hints: 231, 6 


12.55 Letv = 3i — 2j + k. Find nonzero vectors w and u such that each of v, w, and u is orthogonal to the other 
two. 


12.56 Let A;; be the minor of row 7 and column j of matrix A, as described on page 404. Show that if A is 
invertible, then 
f 1 An -An A3 
Ansy ya Am Aa me 
d oe hon Lee 


12.57 Show that if a nonzero vector v makes an angle of a with i, an angle of $ with j, and an angle of y with k, 
then we must have cos? a + cos? $ + cos? y = 1. Hints: 273 
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12.58 Show that if u and v are not parallel, then the vector ||v|| u + |{u|| v bisects the angle between u and v. 
Hints: 163 


12.59 In the text, we developed an expression for the cross product by searching for a vector that is normal to 
two given vectors. In this problem, we find another way to develop the expression for the cross product of two 
vectors. So, for this problem, set aside what you already know about the cross product. Suppose we define the 
cross product such that we have the following: 


(i) ixj=k,jxk=i,andk xi=j 
(ii) a x b = —b Xa for any a and b 
(iii) a x (kb) = k(a x b) for any k, a, and b 
(iv) ax (b+c) =aXb+axc for any a, b, and c 
What we have done here is defined the cross product to have some basic algebraic properties that we'd like 
the cross product to have. In the problems below, you will prove that starting from the properties above, we can 


derive some of the other properties of the cross product given in the text, as well as the expression for v X w in 
terms of the components of v and w. 


(a) Show that v x v = 0 for any vector v. 


U2 V3]. 


(b) Show thatif v = vii + voj + v3k and w = wii + w2j + w3k, then v x w = TE 
2 W3 


(c) Show that v x w is orthogonal to v and w. 


12.60 Show that (a x b) x (a x c) = (a: (b x c))a. Hints: 19 
12.61 Show that (a x b) - (c x d) = (a - c)(b - d) — (a- d)(b - c). Hints: 36 


12.62 Show that if a, b, c are linearly independent, and each two of them are orthogonal, then for any vector v, 


we have 
v-a v-b v.c 
Vi= a7 D4. 
llal| Ilb] lIel| 


12.63 Does there exist a 3 x 2 matrix M such that for every vector v in 3 dimensions, there is a vector u in 2 
dimensions such that Mu = v? Hints: 181, 135, 41 


12.64 Find the shortest distance from (0, 1,2) to a point on the line through (4, 1, —3) and (—2,5, 1). Hints: 59 


12.65 Let A, B, C, and D be points in space such that the volume of parallelepiped with edges AB, AC, and AD 
is 80. Hints: 242, 199 


(a) If AB and AC are edges of parallelepiped P and AD is an interior diagonal of P, then what is the volume 
of P? 
(b) If AB, AC, and AD are face diagonals of Q, then what is the volume of Q? 


12.66 


(a) Prove that for any vector v, there exists a matrix M such that Mw for any w equals the projection of w 
onto v. Hints: 210 

(b) Suppose w is a nonzero vector and M is the matrix such that Mw = proj,w for allw. Find det(M). Hints: 138 

(c) Letv = 3i— 2j + 5k. Find a matrix M such that Mw equals the projection of w onto v for any vector w. 

(d) What is M?, where M is your answer to part (a)? 
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12.67x The distance between two lines in space is the length of the shortest segment that has an endpoint on 
each line. Let vı and vz be direction vectors of two non-parallel lines in space, kı and ky. Show that for any points 
A, on kı and Az on ky, the distance between kı and kz equals 


z 
|4142 : (v1 x V2)| 
llvi x voll 


Hints: 149, 81, 105 


There ain't no rules around here! We're trying to accomplish something! — Thomas Edison 


CHAPTER —_,... al 


Vector Geometry 


13.1 Introduction 


Most of our applications of vectors to geometry start from basic principles that we have already studied. We'll 
start this section with a reminder of these basics, and a reminder of some of the notation we use when applying 
vectors to a geometry problems. All of the following relationships apply both to vectors in two dimensions and 
to vectors in three dimensions. 


We depict the vector from a point O to a point V as OV. When O is the origin, we can express this simply as v. 
We add two vectors using “head-to-tail addition,” an example of which is shown at right. Ẹ 


Point O is the origin, and we find the vector sum V+W by drawing a copy of W starting W 
at point V, thereby producing VP in the diagram. Since VP = W, quadrilateral OVPW is a 


parallelogram, and we have P=V+W. V 
O 
W At left we have a picture of vector subtraction. As explained on page 305, the vector from 
V to Wis VW, where we have 7 we, 
V VW = W-V. 
O Since W and V are vectors from the origin, the relationship VW =W-V gives us a 


convenient way to express any vector in terms of vectors from the origin. 


The last two basic tools we'll use a great deal in this chapter are magnitude and the dot product. The magnitude 


of VW is the length of VW, so we have 
viv] = vw 
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The dot product of two vectors is the product of the magnitudes of the two vectors and the cosine of the angle 
between them. So, if 0 is the angle between vectors V and W, then we have 


= 


0. W =W- Y = ||P] cose. 


Now, let's get to the problems! 


Problem 13.1: Show that for any three points A, B, and C, we have AB — AC = CB. 


Problem 13.2: Show that for any distinct points A, B, C, and D, we have A+C=B+Difand only if ABCD is a 
parallelogram. 


Problem 13.3: 
(a) Let M be the midpoint of AB. Find M in terms of A and B. : 
(b) Let k bea real number, and let T be on AB such that ae = k. Find T in terms of A, B, and k. 


Problem 13.4: 
(a) Geometrically speaking, what does A-A equal? 
(b) Express AB? in terms of A-A,A-B,and È- B. What trigonometric relationship is this known as? 


Problem 13.5: 
(a) Use the dot product to write an equation in terms of A, B, o and D that is true if and only if AB L CD. 
(b) Use vectors to show that each median of an equilateral triangle is also an altitude of the triangle. 


(c) Use vectors to show that if each median of a triangle is also an altitude of the triangle, then the triangle 
is equilateral. 


Problem 13.6: 
(a) Show that if A, B, C, and D are points in space such that AB? + CD? = BC? + AD?, then AC and BD are 
orthogonal. 


(b) Look back at Problem 8.17 on page 274. How is part (a) a more general result than the problem we 
solved in Problem 8.17? What sorts of problems can we consider with vectors that we cannot tackle with 
complex numbers? 


MC i — — — 
| Problem 13.1: Show that for any three points A, B, and C, we have AB — AC = CB. 


Solution for Problem 13.1: We tackle this problem by expressing each vector in terms of vectors from the origin, 
which will allow us to cancel the appearances of A on the left side. But we have to be careful: 


WARNING! Wenhave ADD A AA eens | = 
a 
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SSS sess SSS SSS SSS 


So, we have 


Subtracting the second equation from the first gives 


> > > 


AB — AC = (B- A)-(C- A) = B-C=CB. 


| Problem 13.2: Show that for any distinct points A, B, C, and D, we have A+C=8+Difand only if ABCD is 
| a parallelogram. 


Solution for Problem 13.2: We have an “if and only if” problem, so we really have two B C 
problems. First, we show that if A+C = B + D, then ABCD is a parallelogram. We can 

rearrange the equation as ie aae D so AB = DC. This tells us that AB || DC and that 

AB = DC, so A, B, C, and D are vertices of a parallelogram. We know that this parallelogram A D 


is ABCD, not BACD, because AB = DC tells us that going from A to B is the same direction 
as going from D to C. 


= 


Conversely, if ABCD is a parallelogram, we have DC = AB, soC-D=B-A. Adding A and D to both sides 
givesA+C=B+D. m 

Notice that nothing in our solution to Problem 13.2 assumes that points A, B, C, and D are in the same plane. 
This means our proof above holds in three dimensions as well as in two dimensions. In other words, it shows that 
for any points A, B, C, and D in space, we have A+C=8+Difand only if ABCD is a parallelogram. It follows 
that if A+ Č =B + D, then A,B, C, and Dare saa (meaning they are all in the same plane). 


Problem 13.3: 
(a) Let M be the midpoint of AB. Find M in terms of A and B. 


(b) Letk bea real number, and let T be on AB such that ar =k, Find T in terms of A B, and k. 


Solution for Problem 13.3: 


(a) A natural guess is that M is the average of A and B. To see that this guess is correct, B C 
let O be the origin, and let C be the point such that Č = A+B. Then, OACB is a 
parallelogram. The diagonals of a parallelogram bisect each other, so the midpoint 


of OC is the midpoint of AB. Therefore, Point M is on OC such that OM = SE. This O A 
means that M has the same direction as C but half the magnitude. Therefore, we have 


Important: If M is the midpoint of AB, then M =(A+B)/2. | 


Another way we could have proved this is to note that if M is the midpoint of AB, then AM and BM 
are equal in magnitude, but in opposite directions. We therefore have AM = -BM. Since we also have 
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AM =M-Aand BM = M - B, we must have 

M-A=-(M-B). 
Solving for M gives M = (A +B)/2, as before. 

(b) Our second solution to part (a) gives us a pretty good guide to solving this part. Since T is B 
on AB such that AT/BT = k, we must have TA = K(BT). Notice that we are careful to make DE 


> > > 
sure the two vectors in this equation are pointing in the same direction. Writing TAasA-T A 
and BT as T — B, we have 


Solving for T gives 


A+kB 
TE 
1+k 
One way to think about the relationship among T, A, and B is that T is the weighted b B 
average of A and B. This becomes an even more natural way to think about the relationship ue 
if we let a = AT, and b = BT, so AT/BT = a/b and we have A T 
fz ore = bA + aB 
Lae b+a ~ 


It may seem a bit counterintuitive that we weight A by the length of BT rather than the length of AT. To get 
a sense for why this weighting is correct, note that if T is much closer to A than to B, then BT is much larger 


. . . > 7 ° . . . 
than AT, so oa is close to 1 and z$; is close to 0. This makes T close to A in our expression above, which is 


as expected when T is much closer to A than to B. 


Problem 13.4: S 
(a) Geometrically speaking, what does A - A equal? 


(b) Express AB? in terms of A-A,A-B,andB-B. What trigonometric relationship is this known as? 


Solution for Problem 13.4: 


(a) Since A-A= Ja > the expression A-A equals the square of the distance from A to the origin. 


(b) Just as A- A is the square of the distance from A to the origin, the expression AB - AB equals the square of 
the distance from A to B. Therefore, we have 


> > > 


AB? = AB- AB =(B-A)-(B-A)=B-B-B-A-A-B+A-A. 


Since A- B = B- A, we have 


Letting O be the origin, so that ||A|| = OA, ||B|| = OB, and ZAOB equals the angle between A and B, we 
recognize the equation above as the Law of Cosines: 


AB* = OA? + OB? — 2(0A)(OB) cos ZAOB. 
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Problem 13.5: 


(a) 


(b) 
(c) 


Use the dot product to write an equation in terms of A, B, @ and D that is true if and only if AB 1 CD. 
Use vectors to show that each median of an equilateral triangle is also an altitude of the triangle. 


Use vectors to show that if each median of a triangle is also an altitude of the triangle, then the triangle 
is equilateral. 


Solution for Problem 13.5: 


(a) 


(b) 


We have AB 1 CD if and only if the angle between AB and CD is 90°. An angle between vectors calls for 
the dot product—the angle between AB and CD is 90° if and only if AB-CD =0. Expressing AB and CD in 
terms of A, B, Č, and D, we see that AB 1 ED if and only if 


(- A)-(B-@) = 


Let AABC be an equilateral triangle and let M be the midpoint of BC. To show that each median of the 
triangle is also an altitude, we must show that AM L BC. From part (a), we have AM . BC if and only if 


Since M = (B + C) /2, we have 


That doesn’t look like much fun to expand, but we can pull a trick out of our complex number toolbox to 


help us: 
Choosing your origin wisely can greatly simplify the application of vectors to 
geometry problems. | 


Here, letting A be the origin produces a very convenient form for the expression on the right above, and 
we have 


Concept: 


wm 
A 
2 
O 
lam] 
z 
5 
z 
© 
g 
ge) 
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We factor out $ and expand the right side. The 
we are left with ee 
> > > B G > = 1 2 > 23 
M-(C-B) = E pe Ace B.B). 


B-Band Č- Č are the squares of the magnitudes of B and Č, respectively. Since A is the origin and AABC is 
equilateral, the magnitudes of B and Č are the same. So, B- B = Č . Č and we have 


ay 


M(€-8)=-(C-C-B B) =0 


Since A is the origin, we have M= AM, so M. (c= B) = 0 tells us that AM - BC = 0, which means AM L BC. 
Note that our proof only used the fact that AB = AC, not the fact that all three sides are equal. So, we 
have proved that in any isosceles triangle ABC with AB = AC, the median from A to BC is also an altitude. 
Applying this fact to each pair of equal sides of an equilateral triangle, we see that all of the medians of an 
equilateral triangle are also altitudes of the triangle. 
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(c) 


(a) Show that if A, B, C, and D are points in space such that AB? + CD? = BC? + AD?, then AC and BD are 


(b) 


Oo 
i roblem 13.6: 
| 


We can essentially reverse the steps we took in part (b). If median AM is perpendicular to side BC, then we 


must have Pe. ae 
(M—-—A)-(C-B)= 
We have M = (B + ©) /2, and we let A be the origin, and we have 


B C 2? 2 
EPKER) 


We multiply by 2 and expand the dot product on the left side. The B-Cand -C B terms cancel and we have 
C-C-B-B=0, 


which gives us C.C=B-B. Therefore, the magnitudes of B and C are the same. This means that B and 
C are equidistant from the origin. But A is the origin! So, we have AB = AC. Similarly, we can show that 
AB = BC by considering the altitude from B, and we conclude that AABC is equilateral. 


orthogonal. 


Look back at Problem 8.17 on page 274. How is part (a) a more general result than the problem we | 
solved in Problem 8.17? What sorts of problems can we consider with vectors that we cannot tackle $ 
with complex numbers? 


Solution for Problem 13.6: 


(a) 


(b) 


Writing AB? = AB - AB = (È — A) : (B - A), and likewise for the other terms in the given equation, we have 
(B— A) -(B - A) + (D-C)-(D-C) = (C- B) -(C- B) + (D- A) -(B-A). 


When we expand the dot products on both sides, A.A,B-B,C-C,andD-D appear as terms on both sides, 
so these terms all cancel and we are left with 


> 


-2B-A-2D-C = -28-C-2D-A. 
Dividing by 2 and rearranging this equation gives 
B-C-B-A+D-A-D-C=0. 
We group the first two terms and the second two terms as 
BG ie (=A) = 0. 
We factor again to get ee 
(B - D): (C-A) = 
sO DB . AC = 0. This means that AC and DB are orthogonal. Since BD = —DB, we see that AC - BD = 
—AC - DB = 0, so AC and BD are orthogonal as well. 


The main difference between our result in part (a) and the result in Problem 8.17 is that our work in 
Problem 8.17 only proves the result in two dimensions. Our work in part (a) shows that it holds in three 
dimensions, as well. Moreover, it shows that if we define distance in four or more dimensions analogously 
to our definition of distance in two and three dimensions, then the result holds for higher dimensions, as 
well. This is an example of one significant advantage vectors have over complex numbers: vectors can be 
applied to problems in more than two dimensions. O 
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[Exercises J 


(a) Use vectors to prove that if the diagonals of quadrilateral ABCD bisect each other, then ABCD is a parallel- 
ogram. 


(b) Use vectors to prove that the midpoints of the sides of any quadrilateral are the vertices of a parallelogram. 


13.1.2 We call quadrilateral ABCD is a kite if AB = BC and CD = DA (or if BC = CD and DA = AB). Use vectors 
to show that the diagonals of a kite are perpendicular. 


13.1.3 Show that if M and N are midpoints of AC and BD, respectively, then AB + AD + CB + CD = 4MN. 


13.14 A median of a quadrilateral connects the midpoint of one side of the quadrilateral to the midpoint of the 
opposite side of the quadrilateral. Show that the diagonals of a quadrilateral are perpendicular if and only if the 
medians of the quadrilateral are equal in length. 


13.1.5 Show that in any AABC, we have CA? + CB? = 2CM? + 4, where M is the midpoint of AB. 


13.1.6 Let A, B, C, and D be points in space. Show that if AB and CD are orthogonal and AC and BD are 
orthogonal, then AD and BC are orthogonal. 


13.2 Vectors in the Triangle 
a ~ 
Problem 13.7: Show that all three medians of AABC pass through a point G such that G =(A+B+C)/3. 


Problem 13.8: Show that if we let the circumcenter of triangle ABC be the origin, then all three altitudes of 


AABC pass through the point H such that Ë= A+ B+C. 


Problem 13.9: The centroid of a triangle is the point at which the triangle’s medians are concurrent and the 
orthocenter of a triangle is the point at which the triangle’s altitudes are concurrent. Let G, H, and O be the 
centroid, orthocenter, and the circumcenter, respectively, of AABC. Show that if O and H are distinct points, 
then G is on OH such that OG/OH = 1/3. 


Problem 13.10: Let D, E, F be the midpoints of BC, AC, and AB, respectively, and let J, K, L be the feet of 
the altitudes of AABC from A, B, C, respectively. Finally, let H be the orthocenter of AABC and let U, V, W 
be the midpoints of AH, BH, and CH, respectively. In this problem, we prove that a circle passes through 
D,E,F, ],K,L,U, V, and W. We call this circle the nine-point circle of AABC. 
(a) Let the circumcenter of AABC be the origin. Express D, E, and F in terms of A, B, and C. Suppose T is the 
center of the circle that passes through D, E, and F. Use the forms of D, È, and F to guess T in terms of A, 
B, C. Confirm that your guess is correct by showing that TD = TE = TF. 


(b) Find Ü, Y, and W in terms of A, B, and Č. Show that TU = TV = TW, and that U, V, and W are on the 
circle from part (a). 

(c) Show that UD, VE, and WF are diameters of the nine-point circle. Why does this tell us that J, K, and L 

are on the nine-point circle? 
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Problem 13.11: Suppose that A’, B’, and C’ are on the circumcircle of AABC such that AA’, BB’, and CC’ are 
diameters of the circumcircle. If K, L, and M are the midpoints of BC, AC, and AB, respectively, then show that 


WK, BL, and EM M are concurrent. 


Problem 13.12: Each segment from a vertex of a tetrahedron to the centroid of the opposite face of the tetrahedron 
is called a median of the tetrahedron. Prove that the medians of any tetrahedron are concurrent at a point called 
the centroid of the tetrahedron. Note that the centroid of a triangle cuts each median in a 2 : 1 ratio. Can we 
make a similar statement about the centroid of a tetrahedron? 


Problem 13.7: Show that all three medians of AABC pass through a point G such that G=(A+B+ C) ee 


Solution for Problem 13.7: Let M be the midpoint of BC, so M = (B + C)/2 and G 
M=Ñ-8-" -À M 
We have A B 
=) 2 = A+B+C a BC 2A 2 B+ a 2 2 
ee A -A= a a : -A) 34M. 


Since AG is a scalar multiple of AM, points A, G, and M are collinear. Moreover, since AG = 2AM, point G is on 
AM such that AG = 2AM. 


There’s nothing special in our proof above about vertex A; the same exact steps show that G is on the medians 
from B and from C, and that the distance from G to each vertex is Ẹ of the length of the median from that vertex. O 


Important: ~The medians of any triangle are concurrent at a point called the centroid. If the 
centroid of AABC is G, then G = (A+ B + C)/3. 


Problem 13.8: Show that if we let the circumcenter of triangle ABC be the origin, then all three altitudes of 
| AABC pass through the point H such that H=A+B+C. 


Solution for Problem 13.8: Because H is on the altitude from A, we must have AH 1 BC. C 
Vectors give us a nice condition to test if two lines are perpendicular: the dot product. 


We have AH = H-A=(A+B+CQ)-A=B+C,s0 
Pee B)=8-C-b-B+C.C-C.-B. 
Since B - Č = Č - B, two terms cancel, leaving A B 
AH-BC = C-C-B-B = |C? - IB. 


We'd like to show that this expression equals 0, but now we seem stuck. 


Concept: When stuck on a problem, look back at the problem for any information you 


O= haven't used. 
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We haven't used the fact that the circumcenter of AABC is the origin. The circumcenter of a triangle is 
equidistant from its vertices, so || All| = |B || = Cll. Therefore, we have AH - BC = lel = IIB]? = 0, which means 
that AH 1 BC. Similarly, we can show that BH - AC = CH - AB = 0, so H is on all three altitudes of AABC. o 


Important: The altitudes of any triangle are concurrent at a point called the orthocenter. 
WW Let H be the orthocenter of AABC. If the circumcenter of AABC is the origin, 
then = A+B+C. 


a ae 


: : e 
WARN ING!! Let H be the orthocenter of AABC. Our proof above only shows that we have | 
à H = A + B + C if the circumcenter of AABC is the origin. 
We've already seen the power of choosing the origin wisely several times in problems involving the application 


of complex numbers or vectors to geometry problems. Problem 13.8 shows us a frequently useful choice for 
problems involving triangles. 


When applying vectors to a problem involving a triangle, choosing the circum- 
center as the origin can greatly simplify the problem. 


Concept: 


As we saw in Problem 13.8, this tactic is useful because the magnitudes of the vectors from the origin to the 
vertices of the triangle are all equal if we let the circumcenter of the triangle be the origin. The simple form we 
found for the orthocenter when we let the circumcenter be the origin gives us another good reason to make this 
choice of origin in problems involving the orthocenter. 


| Problem 13.9: Let G, H, and O be the centroid, orthocenter, and the circumcenter of AABC. Show that if H 
and O are distinct points, then G is on OH such that OG/OH = 1/3. 


Solution for Problem 13.9: We have a convenient vector representation for the centroid, and if we let the circumcenter 
be the origin, we have a very convenient representation for the circumcenter and a nice one for the orthocenter. 


Letting O be the origin, we have OG = Gand OH =H. Using our nice expressions for G and H, we have 


Therefore, G is on OH such that OG/OH = 1/3. O 


The line through the orthocenter, centroid, and circumcenter of a non-equilateral 
triangle is called the Euler line of the triangle. (If a triangle is equilateral, these three 
points are all the same.) In the diagram at right, the altitudes are dotted, the medians are 
dashed, and the solid line passing through the orthocenter, centroid, and circumcenter 


of the triangle is the Euler line. 


Problem 13.10: Let D, E, F be the midpoints of BC, AC, and AB, respectively, and let J, K, L be the feet of the 
altitudes of AABC from A, B, C, respectively. Finally, let H be the orthocenter of AABC and let U, V, W be the 
midpoints of AH, BH, and CH, respectively. Show that a single circle passes through D, E, F, J, K, L, U, V, and 
W. This circle is called the nine-point circle of AABC. 
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Solution for Problem 13.10: We can show that all nine points are on 
the same circle by finding a single point that is the same distance 
from all nine points. We think to attack this problem with vectors 
because we can easily handle midpoints and the orthocenter with 
vectors. But to deal with the orthocenter, we must remember to let 
the circumcenter of AABC be the origin. 


We start with the easiest points to work with, the midpoints of 
the sides, since these are the simplest to express with vectors. We 
have 


5. Be 
B=, 
> Awe 
ES z 
> A+B 
a 2 


The desired circle must pass through all three of these points. We let the circle have center T, and will refer to the 
circle as OT. Can we use the expressions for D, E, and F to deduce T in terms of A, B, and C? 


Concept: Don’t overlook “guess-and-check” as a strategy, even in advanced problems. 


C= Guessing and then confirming that your guess is correct has a long and glorious 
| history in mathematics and science. 


We seek a point T such that DEE, ET, and FT all have the same magnitude. Since DT=- D, ERS T= P, and 
FT one Ë, a natural choice for T that offers a lot of cancellation in these differences is 
> A ap B + Č 
T = ————_. 
2 
This expression gives very simple results for DT, ET, and FT: 
ieee eat Ce 
2 2 2 
P -p -p-4tBtC_A+C bi us 
2 2 2 
r-r- P- A+B+C _ A+B = c 
2 2 2 
Recalling that the origin is the circumcenter of AABC, we see that this choice of T is a winner, since ||A l=) B | = || C | 
gives us ||D7| = ETI = FT|). So, T is the center of the circle that passes through D, E, and F; we'll refer to this 
circle as OT. Moreover, because we know that \|A || = R, the circumradius of AABC, and ||D7| = |a ||/2, we know 


that the radius of the circle through D, E, and F is R/2. We also recognize that 

A+B+e_ A 
2 w 

so T is the midpoint of the segment from the circumcenter to the orthocenter. This means T is on the Euler line, 

too! 
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Next, we turn to U, V, and W, because we know how to handle 
the orthocenter with vectors. Since U, V, and W are the midpoints 
of AH, BH, and CH, respectively, we have 


A+H 2448+ 
pee AB eG 
2 2 


pa Ti CE 


W= = 


We need to show that each of U, V, and W is R/2 from T, so we find the magnitudes of Ur VT, and WT: 


> > 3 AB aE 24+B+Č A A R 
Wae ala 
= > > Ane A+2B+Č B B R 
(225 lll a E ||| 
> —— ee BG aA hea C Cll R 
e ed a ae || = lll 


So, OT passes through U, V, and W in addition to D, E, and F. Six points down, three to go! 


The last three points, the feet of the altitudes of AABC, look a lot more challenging. We haven't seen simple 


vector expressions in terms of A, B, and Č for the feet of the altitudes of AABC. So, let’s try learning a little bit 
more about the six points that we have already proved are on the circle. Looking closely at our diagram, it looks 
like DU, EV, and FW each pass through T. If they do, they’re diameters of OT. 


| Concept: Precise diagrams can help you discover important relationships in geometry prob- | 
| lems. 


Of course, it’s not good enough that DU, EV, and FW look like 
diameters. We have to prove it. Vectors can help us with that, too. 
The midpoint of DU is 


+C 


> ey B. louie = 
D+U _ Beč 4 2AB _ A+ 


2 2 


2 


NI Soy 


Success! T is the midpoint of DU, so DU is indeed a diameter of OT. 
Similarly, = =T and Pu = T so EV and FW are diameters of OT 
as well. 


Now, we're ready to tackle J, K, and L. The circumcircle of a right triangle is the circle with the hypotenuse of 
the triangle as a diameter. Therefore, the circumcircle of right triangle DJU is the circle with hypotenuse DU as 
diameter. But this is OT! Since OT is the circumcircle of ADJU, it passes through J. Similarly, OT is the circumcircle 
of right triangles EKV and FLW, so K and L are on OT, as well. O 


“C Extra! The midpoint of any segment from the orthocenter of a triangle to a point on the circumcircle of 
miis the triangle is on the nine-point circle of the triangle. 
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Sidenote: The nine-point circle has interesting relation- 

N ships to other parts of a triangle. For example, 

the incircle of a triangle is tangent to the nine- 

point circle, as shown at right (except when the 

triangle is equilateral, in which case the two cir- 

cles are the same). The nine-point circle is also 

tangent to the excircles of AABC, which are the C B 

three circles outside AABC that are tangent to one side of the triangle and tangent 

to the extensions of the other two sides. That the nine-point circle of a triangle is 

tangent to the incircle and the excircles of the triangle is sometimes referred to as 
Feuerbach’s Theorem. 

One direct result of Feuerbach’s Theorem is that the inradius of a triangle is no 

more than half the circumradius of the triangle. See if you can figure out why! 


| Problem 13.11: Suppose that A’, B’, and C’ are on the circumcircle of AABC such that AA’, BB’, and CC’ are 
| diameters of the circumcircle. If K, L, and M are the midpoints of BC, AC, and AB, respectively, then show that 


| BL, and E'M are concurrent. 


Solution for Problem 13.11: We use vectors because we know how to express all the key 


points with vectors in terms of A B, and C. First, we have B’ A’ 
5) B + C > C + A > A ar B 
K = = = . 
2. 4 2 i 2 
Then, we let the circumcenter be the origin. Since the circumcenter is the midpoint of 
AVA, B'B, and C’C, we have A’ = —A, B’ = -B, and Č = -C. A 


We seek a point P such that P is on AK, BL, and C’M. From the symmetry of the 
problem, we expect that we can express P as some symmetric expression in terms of A, B, and Č. Of course, the 


simplest outcome would be if P were the origin, but our diagram makes it clear that the lines WR, BT, and EM 
don’t meet at the center of the circle. 


So, we try the next simplest reasonable guess, the orthocenter. In fact, this might have been our first guess if 
we had taken a close look at the diagram! If P = A + B + Č, we have 


Ae _2Å+B+Č 
> 


Success! We have A’P = 2KP, so the orthocenter is on AK for any choice of A, B, and C. Similarly, we can show 
that the orthocenter is on BL and EM, so the three lines are concurrent at the orthocenter. 


Once we made the observation that we expect P to have a symmetric expression in terms of A, B, and C, we 
> 


also could have found P without further guessing. We start by noting that A’K = Hee - (-A) =A+ Bae . SO, we 
seek the vector P = t(A + B + C) such that A’P is a scalar multiple of A + 24°. We have 

A’P = (A+ B+C)-(-A) = (t+ 1)A4 1B +Ë. 
This is a scalar multiple of Ae BxC if and only if t +1 = 2t, which gives us t = 1. By essentially the same argument, 
the point P such that P = A+B + Cis on the other two lines as well. 0 
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l Let's see if we can take some of our tools for triangles up one dimension, and apply them to the three- 
dimensional analogue of a triangle: a tetrahedron. 


Problem 13.12: Each segment from a vertex of a tetrahedron to the centroid of the opposite face of the 
tetrahedron is called a median of the tetrahedron. Prove that the medians of any tetrahedron are concurrent 
at a point called the centroid of the tetrahedron. The centroid of a triangle cuts each median in a 2 : 1 ratio. 
Can we make a similar statement about the centroid of a tetrahedron? 


Solution for Problem 13.12: The vector to the midpoint of a segment is the average of the vectors to the endpoints 
of the segment. The vector to the centroid of a triangle is the average of the vectors to vertices of the triangle. A 
reasonable guess is that if there is a point G on all four medians of some tetrahedron ABCD, then 


ae Rese 8 
4 
Let V be the centroid of face BCD, so 
= B+C+D 
w= 
3 
To show that G is on AV, we find AG and AV: 
A Bey oot et CHD eee 
4 4 
Ai CY- HIE go 


So, we have AG = SAV, which means that G is the point on AV such that AG/AV = 3/4. We can similarly show 
that this choice of G is on each of the other medians, and divides each median in this same ratio. Therefore, just 
as the medians of a triangle are concurrent at a point that divides each median in a 2 : 1 ratio, the medians of a 
tetrahedron are concurrent at a point that divides each median in a3: 1 ratio. O 


I] __txercises Ba 

13.2.1 Find the centroid of the triangle with vertices (1, 4, —1), (—4, 5, 2), and (6, -1,7). 

13.2.2 Leta = BC, b = AC, and c = AB in AABC. 
(a) Let D be on BC such that AD bisects ZBAC. Find Din terms of a, b, c, A, B, and Č. Hints: 104 
(b) Let I be the incenter of AABC. Find Tin terms of iy eee A B, and C. 


13.2.3 Show that for any point P and any triangle ABC, the sum of the squares of the sides of the triangle is no 
greater than 3 times the sum of the squares of the distances from P to each of the vertices of AABC. At what point 


do we have equality? 
13.2.4 Show that if H and O are the orthocenter and circumcenter, respectively, of AABC, then H =A+B+C-20. 


13.2.5 The centroids of the faces of a tetrahedron are (17,0,3), (—2, 13,3), (1,2,15), and (1,-1,2). What are its 
vertices? 


13.2.64 Points U, V, and W are the reflections of the circumcenter of AABC over BC, AC, and AB. Show that Au, 
,and CW are concurrent. At what special point of AABC are they concurrent? 
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13.3% Vectors, Complex Numbers, and Challenging Problems 


Our work so far in this chapter probably reminds you of some of the work we did in Chapter 8 with complex 
numbers. For example, if M is the midpoint of AB, then M= (A + B)/2 in vectors and m = (a + b)/2 in complex 
numbers. Also, if OACB is a parallelogram and O is the origin, then A+B = Č in vectors and a + b = c in complex 


numbers. If G is the centroid of AABC, we have G = (A+ B+ C)/3 and g = (a+b +c)/3. And so on. What’s going 
on here? 


Let’s look back at what a vector in two dimensions is. Vectors in two dimensions are ordered pairs of numbers 
that have the following operations: 


e Vector addition. We add vectors by adding the corresponding components: 
(1) + (a) = Gea) 
b d) \b+d/° 


e Multiplication of a vector by a constant. We multiply a vector by a constant by multiplying each component 
of the vector by the constant: 
‘Q-@ 
b) — \kb/ ` 


A complex number is an ordered pair of real numbers, the real part and the imaginary part of the complex 
number. Complex numbers have these same two operations: 


e Complex number addition. We have (a + bi) + (c + di) = (a + c) + (b + d)i. 


e Multiplication of a complex number by a real constant. If k is real, we have k(a + bi) = ka + (kb)i. 


We see that complex numbers are vectors, so the similarities between the applications of vectors to geometry and 
the applications of complex numbers to geometry are not a coincidence! 


We also defined multiplication of complex numbers as 
(a + bi)(c + di) = (ac — bd) + (ad + be)i. 


Note that the dot product is not multiplication of vectors in the same sense, since the dot product of two vectors 
is not a vector, whereas the product of two complex numbers is itself a complex number. We saw in Chapter 8 
that complex number multiplication has some very nice applications to geometry, particularly when dealing with 
rotations. 


As we’ve seen, we usually use complex numbers and vectors for different types of algebraic problems. For 
example, we wouldn’t use vector notation to express the roots of x? +x +1, we don’t use the concept of “conjugate” 
with general vectors in two dimensions, and we don’t generally use anything quite like matrices when dealing 
with complex numbers. Some tools overlap, such as magnitude, which is essentially the same in both. 


With most problems that involve complex numbers or more general vectors, it’s clear which tool we should 
use. In this section, we look at a series of very challenging problems in an area where the applications of complex 
numbers and vectors overlap a great deal: geometry. Some of these problems are best tackled with the machinery 
we’ ve developed for complex numbers, others with the tools we’ve used with vectors. Still other problems can 
be tackled with either, and others with concepts drawn from both. 
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a - 


Problem 13.13: Consider the points A(0, 12), B(10, 9), C(8, 0), and D(—4,7). There is a unique square S such that 
each of the four points is on a different side of S. Find the area of S. (Source: AIME) Hints: 106 


Problem 13.14: Points P and Q are on sides AB and AC of equilateral triangle ABC such that PQ || BC. Show 
that if M is the midpoint of PC and T is the centroid of AAPQ, then ATMB is a 30-60-90 triangle. 


Problem 13.15: Way back on page 185, we helped Petr figure how long his trip to Honolulu would be by 
finding the distance along the surface of the earth from his home city of Petropavlovsk-Kamchatsky, which has 
coordinates 52.92° N, 158.49° E, to Honolulu, which has coordinates 21.31° N, 157.83° W. Before he left on his 
trip, he tried to look up the distance between the two cities on the internet. He found a formula that said the 
distance between points P and Q on the Earth is œR, where R is the radius of the Earth and a is the angle such 
that 0 <a < n and 


cos @ = cos Ap cos Ag cos(AB) + sin Àp sin Àg. 


Unfortunately, he couldn't find any information about what the symbols Ap, Ag, and AB mean. Help Petr by 
figuring out what this formula is all about. Hints: 4 


Problem 13.16: Let O be the center of the circumcircle T of a regular polygon A1A2A3--: A25426 with 26 sides. 
Let O; and O; be the reflections of O over A254; and A246, respectively. Prove that O;O2 is equal in length toa 
side of an equilateral triangle inscribed in T. Hints: 191 


Problem 13.17: Let ABCDEF be a convex hexagon with AB = BC, CD = DE, and EF = FA. Show that the lines 
containing the altitudes of triangles BCD, DEF, and FAB from C, E, and A, respectively, are concurrent. (Source: 
Poland) Hints: 275 


Problem 13.18: Use complex numbers to prove that the area of a triangle with side lengths a, b, and c is 


y s(s — a)(s ~ b)(s - c), 


where s = (a + b + c)/2 is the semiperimeter of the triangle. Hints: 285 


| Problem 13.13: Consider the points A(O, 12) B(10, 9), C(8,0), and D(—4,7). There is a unique square S such 
| that each of the four points is on a different side of S. Find the area of S. (Source: AIME) 


Solution for Problem 13.13: First, we note that because AC and BD intersect, each y 
segment connects points on opposite sides of the square. Next, we wonder 
what additional piece of information we need in order find the side length of 
the square. 


If we knew a direction of the side of the square through A, that would be p. 
enough for us to build the whole square. The opposite side has the same direc- 
tion, so we could use this direction to draw lines through A and C containing 
the sides of the square. Then, we draw lines through B and D perpendicular to 
the line through A, and the four intersection points of our four lines give us the 
vertices of the square. 


C 


All this talk about “direction” has us thinking about vectors. Let’s see how we might apply vectors to the 
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problem. We let u = ¢ ) be a unit vector in a direction of the side of the square through A. Any vector orthogonal 
to U ) is perpendicular to the side through A, and is therefore in the direction of the side through B. This means 


that the vector v = a is in the direction of the side through B. That uses the fact that consecutive sides of a 


square are perpendicular. Now, we look for a way to use the fact that the sides of a square are the same length. 
We can find a vector whose length equals the side length of the square by projecting BD onto l ) , the direction 


of the side through A. Similarly, the magnitude of the projection of AC onto e must also be the length of a 


side of the square. 


Since AC = G z. = E and BD = T B : o = (es ) , and u and v are unit vectors, the projections of 


AC onto v and of BD onto u are 


oo, _ AC-v _ -8q-12 
proj AC = rv = eo = (8 - 12p)v, 
a JD Ee 
proj BD = 2 = = EEA N = (—14p — 2q)u. 
Iul 1 


As discussed above, the magnitudes of these projections are equal. We have |lul| = ||vi| = 1, so the magnitudes of 
these projections are |-8q — 12p| and |—14p — 2q]. If these are equal, then we have either —8q — 12p = —14p — 2q or 
—(—8q — 12p) = —14p — 2q. The first gives p = 3q and the second gives 13p = —5q. So, we have 


Om À 


(Remember, E ) is a unit vector.) A quick sketch of both cases tells us which we want, and exhibits the interesting 


() = eva) 


result that if there’s a second square such that each of the given points is on a different extended side of the square. 


O- 


We have p = 3/V10 and q = 1/V10 in the case in which A, B, C, and D are on the sides of the square themselves 
(rather than the extensions of the side). The side length of the square equals the magnitude of the projection found 


above, or |—8q — 12p| = 44/ ¥10. Therefore, the area of the square is (44/ VIO} = 193.6. 


O= ea) 


(Note that we can also take (e ) to be the negatives of the direction vectors found above, and doing so leads 


to the same two squares.) 0 
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a 


The triangle relationships we developed in the previous section using x vectors also 
hold in complex numbers. Let a, b, and c be complex numbers that correspond to | 
the respective vertices of AABC in the complex plane. Then, the centroid of AABC 
is (a + b + c)/3. If the origin is the circumcenter of AABC, then the orthocenter of 
AABC is a+ b la c and the center of the nine-point circle of AABC i is s (a +b + c)/2. 


Sidenote: 


Problem 13.14: Points P ser id Q are on mere AB sate AC i equilateral tengi ABC such that PQ || BC. Show 


that if M is the ea of PC and T is the centroid of a then ATMB i isa 30- 60- 90 BEES 


Solution for Problem 13.14: We have complex number tools for dealing with equilateral B 
triangles, so we'll start with complex numbers. As usual, we denote the complex 
number corresponding to each point with the lowercase version of the uppercase letter 
used to label the point. We let A be the origin, so equilateral triangle ABC gives us 


b = Cc, where C is a primitive sixth root of unity. Since PQ || BC, we have ZAPQ = ZABC P 

and ZAQP = ZACB, which means AAPQ is equilateral, too. Therefore, the same rotation M 

that maps C to B also maps Q to P, which means that p = Cq. Moreover, Q is a dilation T 

of C about the origin, so there’s some real constant k such that q = kc. Therefore, we A Q C 


have p = Ckc, and we now have b, p, and q in terms of c. 


The only points we haven’t tackled yet are M and T. Since M is the midpoint of PC, we have 


_p+c (Cc le 
i ee 


Since T is the centroid of AAPQ, we have 


oo ORE ket Che (1+ 0kc 

3 3 gm 

Looking at our diagram, it appears that if ATMB is a 30-60-90 triangle, then TMB = 90° and ¿MTB = 60°. We 
could use our usual complex number tools to show separately that |b — t| = 2|m — t| and BM L TM, and thereby 
deduce that ATMB is a 30-60-90 triangle. Instead, we note that we can prove that ATMB is a 30-60-90 triangle in 
one step if we show that b — t = 2C(m — t), since this equation would tell us that BT = = 2MT and that a 60° rotation 
about T (with the same orientation as the rotation we used to map C to B) maps TM to TB. From this, we can 
deduce that ATMB is a 30-60-90 triangle with ZBTM = 60° and ZBMT = 90°. 


Now that we have a plan, it’s time to execute it: 


2C(m — t) = 20 = - Oe = 2ct Se = F(k+3-2) 
and 


A Lor _ Sho ke = Uke _ 


3 


To see that these are equal, we'll have to use the fact that Ç? = C—1 for either primitive sixth root of unity: 


c 
Seay): 


2U(m — t) = © (Ck + 3-24) = £ (Ck + 30 — 2k) = 5 (C= 1)k + 37 = 20) = (BE -k - K0). 


This equals our expression for b — t above, so b — t = 2C(m — t), which means that ¿BTM = 60° and BT = 2MT. 
Therefore, triangle TMB is a 30-60-90 triangle. O 
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Vectors give us another way to think about some applications of complex numbers B 
to geometry problems. For some people, it is much more natural to think of rotating 
TM by 60° about T to get a vector in the direction of BT than it is to think of rotating 
m by 60° about t to get a complex number that is on the ray from t through b. One way 
to tie these two ideas together is to associate each complex number with a vector from 
the origin to the corresponding point i in the complex plane. So, TM is the vector from 
t to m in the complex plane. Since TM = M-T, we associate TM with the complex 4 C 
number m — t. Q 


With this in mind, you might realize that you want to show that b — t = 2¢(m — t) by thinking, “Triangle TMB is 
a 30-60-90 triangle if ZMTB = 60° and BT = 2MT. So, if rotating 1m — t by 60° and then scaling the result by a factor 
of 2 produces b — t, then ATMB is a 30-60-90 triangle. Therefore, we wish to show that b — t = 2C(m — t), where the 
factor of Ç provides the rotation and the 2 provides the scaling.” 


Problem 13.15: Way back on page 185, we helped Petr figure how long his trip to Honolulu would be by 

| finding the distance along the surface of the earth from his home city of Petropavlovsk-Kamchatsky, which $ 
has coordinates 52.92° N, 158.49° E, to Honolulu, which has coordinates 21.31° N, 157.83° W. Before he left on 
his trip, he tried to look up the distance between the two cities on the internet. He found a formula that said 
the distance between points P and Q on the Earth is aR, where R is the radius of the Earth and a is the angle 
such that 0 < a < n and 


cos a = cos Ap cos Ag cos(Af) + sin Ap sin Ag. 


Unfortunately, he couldn’t find any information about what the symbols Ap, Ag, and AB mean. Help Petr by 
| figuring out what this formula is all about. 


Solution for Problem 13.15: The key to our solution to the earlier problem was finding the angle formed by 
connecting each city to the center of the Earth. Vectors give us a nice tool for finding angles: the dot product. In 
order to use the dot product, we'll have to use the rectangular coordinates of the cities. However, the latitude 
and longitude are in terms of coordinates that are essentially spherical. So, we'll start with spherical coordinates 
for the cities and convert them into rectangular coordinates. Let the spherical coordinates of cities P and Q be 
(R, Op, dp) and (R, Og, po), where R is the radius of the Earth. Converting these to rectangular coordinates, city 
P is at (R cos Op sin dp, R sin Op sin dp, R cos gp) and city Q is at (R cos Og sin dg, R sin Og sin ġo, R cos ġo). Letting 
the center of the Earth be the origin, we have 


G = ||P IGI cosa = R? cosa, 
where a is the angle between P and Q. Computing the dot product gives 
P- Q = (R cos Op sin Pp)(R cos Og sin bg) + (R sin Op sin ġp)(R sin Og sin ġo) + (R cos hp)(R cos po) 

= R?(cos Op sin op cos Og sin hg + sin Op sin p sin Og sin bg + cos hp cos po) 

= R*((sin pp sin dg)(cos Op cos Og + sin Op sin 8g) + cos Pp cos QQ) 

= R*((sin dp sin ġo) cos(Op — 6g) + cos bp cos hg). 
Substituting this into the equation above for P- Q, we have 

R? ((sin dp sin po) cos(Op — Og) + cos bp cos bg) = R? cosa. 


Dividing by R? gives 
cos g = sin dp sin dg cos(Op — Og) + cos dp cos da. (13.1) 
This looks very close to the formula Petr found, which is 


cos æ = cos Àp cos Ag cos(Af) + sin Ap sin Ag. 
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Right away, we suspect that AB = Op — Gg. Since the @ coordinates in spherical coordinates are essentially the 
same as longitude, we can take the Af to be the difference in longitude between P and Q. 


Turning to the relationship between the @ angles and the A angles, we see that for these angles, Petr’s formula 
has cosines where ours has sines, and vice versa. We unravel this mystery by recalling the relationship between 
latitude and the -coordinate of spherical coordinates. If P is in the Southern hemisphere and O is the center of 
the Earth, then the angle OP makes with the upper half of the Earth’s axis (p in spherical coordinates) is 90° 
plus the latitude of P. If P is in the Northern hemisphere, then the angle OP makes with the upper half of the 
Earth's axis is 90° minus the latitude of P. Taking Ap to be the latitude of P, where we let Ap > 0 for points in the 
Northern hemisphere and Ap < 0 for points in the Southern hemisphere, we have dp = 90° — Ap. Therefore, we 
have cos @p = sin Ap and sin @p = cos Ap, so Equation (13.1) becomes 


cos a = cos Ap cos Ag cos(Af) + sin Ap sin Ag, 


and the mystery of Petr’s formula is solved. 0 


| Problem 13.16: Let O be the center of the circumcircle T of a regular polygon A;A2A3- 


1 
J 


se Ar5Ar6 with 26 sides. 
Let O; and O; be the reflections of O over A254; and A246, respectively. Prove that 0102 is equal in length to 
|a side of an equilateral triangle inscribed in I. 


Solution for Problem 13.16: Our main clue to start with complex numbers is the regular polygon. We place the 
problem in the complex plane by letting O be the origin and letting A; correspond to e*/79 for 1 < k < 26. So, if we 
let w = e™/!3, then A; corresponds to w*. Since O; is the reflection of the origin over A254, the midpoint of A2541 
is the midpoint of OO}. Therefore, O; is the dilation of this midpoint about the the origin with scale factor 2. The 
midpoint of A254; is (w* + w)/2, so O; corresponds to twice this number, or w” + w. Similarly, O2 corresponds to 


OT w 


We wish to show that O10 equals the length of a side of an equilateral triangle inscribed in the circle. The 
radius of the circle is 1, so the side length of an equilateral triangle inscribed in the circle is v3. We have 
0102 = |w® + w? — (w” + w)|, So We now must show that 


lw® +o? —w — w| = V3. 


We'll look at O; OF since we can use the fact that |z|? = zZ to work with our scary expression for 0102. We have 


OLO = (aÉ + w? — w” - w(i + a? — © — w) = (W +a? - o” — w (0° + - ©” -@). 


That looks plenty terrifying to expand, but before we start doing so, we make a few simplifying observations. 


2 Zol e 35 
First, because |w*| = 1, we have w*a* = 1, which means @* = zz. Since w = 1, we have 
26 
ee du eT oak 
wk ak 


(See if you can find a geometric explanation for this relationship.) Applying this to the terms in the second factor 
of our expression for 0102 gets rid of the conjugates and leaves 


0,02 = (aÉ + a —w” — w)(w + w” — w - w”). 


Next, we note that every vertex of the polygon is diametrically opposite another vertex of the polygon, and is 13 
vertices away from its opposite. So, rotating one vertex by 7 around the origin gives a vertex that is 13 vertices 
away from the original vertex. In complex numbers, we write this as w* = —wk-}8, Another way to see this is to 
note that w! = (e#/13)13 = et! = -1, so w* = œt! - w? = -w*". Applying this relationship to every term in the 
expression with exponent 13 or greater, we have 


11 


0,03 = (w® + Ww +a! —w)(-—w’ -w -w + a) 
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Now, we're ready to expand, keeping in mind the simplification w* = ~w*-3 whenever we see a power of w 
greater than 12. We find 


11 


O10} = (wf + @* + w? -— w)(-w? -w -w+ w”) 


= w(—w — w! — w + w) + w (w -wo -w + w!) 
E a =o =a) 


13 


a) =(—as =w 


= (ea =a Say a (= —w? —w + w) 


e e Rei E O e O) 

= (14+ w* -w —w°) + (-w’ +1- -w) 

ae. hla. )+(a° +o +o +) 

Ati =o fe -w +e a +a —o +o =o +o 

That’s quite a compelling expression! We can simplify it either by recognizing it as geometric series with common 
arag PEREP ' B y ganna 8 
ratio —w, or by recalling the factorization 
el (oriG =o +o -o ta -o +0 —0 +o —@ o_o) 
Since we have w” = —1 but w # 1, the factorization above gives us 
Ce a a ny a a ee ae ee ee 

so w — w+ wl -w +w -w twi- w + w* — 3 +? —w = —1. Substituting this into our expression for 0103 
gives 0,03 = 4-1 = 3, so O10; = V3, as desired. O 


Concept: Sometimes persistent algebraic manipulation can tackle hard geometry problems 


O= when your geometric instincts fail you. 


That said, don’t always reach for complex numbers or vectors right away when you see a geometry problem. 


Unless you very quickly see that vectors or complex numbers can be used to 
solve a particular geometry problem, you should usually spend some time 
trying to apply geometric tools to the problem before reaching for algebraic 
tools. 


WARNING! 
f as 


Problem 13.17: Let ABCDEF be a convex hexagon with AB = BC, CD = DE, and EF = FA. Show that the 
lines containing the altitudes of triangles BCD, DEF, and FAB from C, E, and A, respectively, are concurrent. 


(Source: Poland) 


Solution for Problem 13.17: The condition for perpendicularity in vectors is a little easier B 

to work with than the condition we have for complex numbers, so we'll try applying 

vectors to the problem. We'll let X be the intersection of the altitudes from C and E,and A C 
our goal is to prove that X is on the altitude from A, as well. Since X is on the altitudes 

from C and E to BD and DF, respectively, we have CX - BD = 0 and EX - DF = 0. We'd 


like to prove that AX - FB = 0. Since there are many points in common in each of these 


statements, we write each vector as a vector difference, hoping to be able to find some D 
convenient ways to combine the equations: E 

(X— C)-(D-B)=0, 

(XS) = 
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It’s tempting at this point to choose X to be the origin, but the conditions AB = BC, CD = DE, and EF = FA 
give us pause. It’s not clear how we would use these conditions if X were the origin. (That said, it is possible to 
finish the problem with X as the origin! See if you can find such a solution on your own.) Rather than choose our 


origin right away, we'll continue with the problem, hoping that we'll find a choice of origin that allows us to use 
the given equal lengths effectively. 


Our hopes of combining the two dot products above to show that (X m) . (B =) = 0 appear to be dashed—there 
are no instances of A in the first two equations, and it’s not clear how to introduce A and get rid of E and C. We do 
at least see that we'll get some cancellation if we expand and add the left sides, since the X- Din the first equation 
will cancel with the —X - D in the second. But the rest will be a mess. 


The symmetry of the problem suggests that we go ahead and include the desired dot product: 
(X- C)-(B-B) =0, 
(X- E)- FD) =0, 
(X—A)-(B-P) =k. 


Our goal is to show that k = 0. Again, note that we’ve set up these equations to have a convenient symmetry. This 
symmetry would have been obscured if we had written the equations in the equivalent forms: 


(C=) (0 .—5).=0, 
(X- £)-(D-F) =0, 
(- X). (F -B)=k. 


When you have options for how to set up parts of a problem, do so in a way that 
allows you to exploit symmetry where possible. 


Concept: 


Returning to the equations 


we see that if we expand the dot products on the left and add all the results, all of the terms with X will cancel, 
and we'll be left with eee’ E Saree 

-C . (D - B) — E. (F - D) -A -(B-F) =k. 
We'd like to show that the left side is 0, but in this form, it’s not clear how to. Expanding everything on the left 
side won't offer any cancellation. However, we could group the terms on the left side by B, D, and F instead of by 
A C and Ë. 


Concept: A different way of looking at the same information can offer important insights. 


O= 


Grouping the terms by B, D, and F gives 


> 2 


k=-C-(0-—B)-B-(F-D)-A-(B-F) = 8 -(C-A)+D-(E-C)+F-(A-8). (13.2) 


OK, maybe that’s not so helpful, either. We seem stuck. 
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Concept: When stuck ona problem, focus information in the problem that you haven’t used 


O= yet. 


We haven't used the side equalities. What can we deduce from AB = BC? We can deduce that B is on the 
perpendicular bisector of AC. Similarly, D is on the perpendicular bisector of CE and F is on the perpendicular 
bisector of AE. Aha! There’s a point on all three of these perpendicular bisectors—the circumcenter of AACE! 
Looking back at our expressions above, we see that this observation solves the problem. If we choose the 


circumcenter of AACE to be the origin, then OB 1 AC, OD L CE, and OF 1 EA, so we have 


> 2 2 > > 


Bode Gn — EA 0) 
Applying these to Equation (13.2), we have 
LEC Aye D-(E —C)+ F:(A—-E£) =8- AC D CEE- EAD. 


Since AX - FB = k = 0, point X is also on the altitude from A to FB. Therefore, the three altitudes described in the 
problem are concurrent. 0 


Problem 13.18: Prove that the area of a triangle with side lengths a, b, and c is 


4/s(s — a)(s — b)(s - c), 


where s = (a + b + c)/2 is the semiperimeter of the triangle. 


Sidenote: Art of Problem Solving Community member Miles Dillon Edwards produced the 
A following dazzling proof of Heron’s formula while still in high school. His proof 
was published in the American Mathematical Monthly in 2008. 


Solution for Problem 13.18: We need to prove a formula for the area of 
a triangle that involves the semiperimeter. We already have one such 
formula: the area of a triangle equals its semiperimeter times the radius 
of its incircle (which we call the inradius). So, we start with the diagram 
at right, in which the angle bisectors of AABC meet at the incenter, I, as 
shown. We connect the incenter to the three vertices and to the points at 
which the incircle is tangent to the sides of the triangle, forming three pairs 
of congruent right triangles, such as AAFI and AAEI. We label the pairs of 
equal angles at I as shown, as well as the equal tangents from the vertices. 


We let the radius of the circle be r, and we label the distances from I to B y D Z G 
the vertices as shown. Adding all the angles at I gives us a + f + y = 7, since all six angles at I must sum to 27. 
That’s a pretty convenient angle. Multiplying complex numbers leads to adding angles. Maybe we can tackle this 
problem with complex numbers. 


We'll need complex numbers with arguments a, f, and y. In right triangle IEA, we have 7? + x* = u? and 
tana = 7, so the complex number r + xi has magnitude u and argument a. In other words, we have r + xi = ue". 
Similarly, we have r + yi = ve? from ABFI and r + zi = we from ACDI. 


Our purpose in finding these complex numbers was to multiply them, and thereby add their arguments: 


(r + xi)(r + yi)(r + zi) = (ue!)(vel®)(we'”) = uvwell@*F*Y), 
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Since a+ ß +y = 7, we have 
(r + xi)(r + yi)(r + zi) = uvwe™ = —uvw, 


which means that (r + xi)(r + yi)(r + zi) is real. Therefore, the imaginary part of this expression is 0. Expanding this 
product yields gives 


Im ((r + xi)(r + yi)(r + zi)) = P(x + y + z) — xyz. 
Since this equals 0, we have 
XYZ 


f= =, 
oop ae 


Since the perimeter of AABC is 2(x + y+z),wehavex+y+z=s. Letting a = BC, b = AC, and c = AB, we have 
a=y+z,b=x+z,andc = x + y. Therefore, we have x = s — y -z = s — (y + z) = s ~ a. Similarly, we find y = s — b 


and z = s — c. So, we have 
ee S 
: 3 


We finish by noting that the area of AABC is the sum of the areas of AIAB, AIAC, and AIBC, which is 


ra rb rc_ fa+b+c\_ = /s(s —a)(s — b\(s —c) 
eo 4S er 5 ) =rs= s(s — a)(s — b)(s — c). 


This formula is known as Heron’s Formula. O 


13.4 Summary 


‘Important: If M is the midpoint of AB, then M = (A + B)/2. 


| 
| 
j 
| 
L 


Important: The medians of any triangle are concurrent at a point called the centroid. If the 
centroid of AABC is G, then G = (A+B +0)/3. | 


Important: The altitudes of any triangle are concurrent at a point called the orthocenter. 
Vv Let H be the orthocenter of AABC. If the circumcenter of AABC is the origin, 
| 


Things To Watch Out For! 
WARNING!! e We have AB = B- A, not AB = A - B. 
ry f 
e Let H be the orthocenter of AABC. We proved that if the circumcenter 
of AABC is the origin, then H=A+B+C. Ifthe circumcenter of AABC 
is not the origin, then we cannot conclude that Bi Mae, 


485 


CHAPTER 13. VECTOR GEOMETRY 


Problem Solving Strategies 


Concepts: e Choosing your origin wisely can greatly simplify the application of vectors 
to geometry problems. 


e When stuck on a problem, look back at the problem for any information you 
haven't used. 


e When applying vectors to a problem involving a triangle, choosing the 
circumcenter as the origin can greatly simplify the problem. 


e Don’t overlook “guess-and-check” as a strategy, even in advanced problems. 
Guessing and then confirming that your guess is correct has a long and 
glorious history in mathematics and science. 


e Precise diagrams can help you discover important relationships in geometry | 
problems. 


e Sometimes persistent algebraic manipulation can tackle hard geometry 
problems when your geometric instincts fail you. 


e When you have options for how to set up parts of a problem, do so in a way | 
that allows you to exploit symmetry where possible. | 


e A different way of looking at the same information can offer important 


| 
| 
| 
| 
| 
| 
| 


13.19 Show that for any points W, X, Y, and Z, we have WX? + YZ? — XY? - ZW? = 2WY - ZX. 


13.20 Use vectors to show that the sum of the squares of the side lengths of a quadrilateral equals the sum of the 
squares of the diagonals of the quadrilateral if and only if the quadrilateral is a parallelogram. 


13.22 Let the diagonals of ABCD meet at X. Show that if 
AB + BC + CD? + DA? = 2(AX* + BX? + CXE pe) 
then either the diagonals of ABCD are perpendicular or one of the diagonals bisects the other. Hints: 15 


13.23 Mr. Assumption was trying to find the orthocenter of the triangle with vertices (4, —3), (2, —1), and (-1,5). 


Here’s what he wrote: “If AABC is a triangle, then the orthocenter H satisfies H = A+B+C. So, we get the 
orthocenter by simply adding the coordinates: (4 + 2—1, -3 - 1 + 5) = (5, —1).” What mistake did Mr. Assumption 
make? 


13.24 Let O be the circumcenter of AABC. Let M be the midpoint of AB, and let X be the centroid of AACM. 
Show that CM . OX if and only if AB = AC. 


13.25 Let ABCD bea cyclic quadrilateral (a quadrilateral that is inscribed in a circle). Show that the orthocenters 
of ABC, BCD, CDA, and DAB are the vertices of a quadrilateral that is congruent to ABCD. 
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13.26 Let O, H, and R be the circumcenter, orthocenter, and circumradius, respectively, of AABC. 
(a) Show that OH? = R*(3 + 2(cos2A + cos 2B + cos2C)). Hints: 224 
(b) Show that in any AABC, we have cos? A + cos? B + cos? C > 2. Hints: 82 


13.27 Let point X be any point inside AABC. Through the midpoints of AB, BC, and CA, we draw lines that are 
parallel to XC, XA, and XB, respectively. Show that these three lines are concurrent. Hints: 64 


13.28 Points U, V, W, X, Y, and Z are the centroids of triangles ABC, BCD, CDE, DEF, EFA, and FAB, respectively, 


in convex hexagon ABCDEF. Show that each pair of opposite sides of hexagon UVWXYZ are parallel and equal 
in length. 


13.29 In tetrahedron ABCD, the angle ZBDC isa right angle and the foot of the perpendicular from D to ABC is 
the intersection of the altitudes of AABC. Show that ZADB = ZADC = 90°. (Source: IMO) Hints: 215, 258 


13.30 Harrison wants to construct a hexagon such that the midpoints of its edges have coordinates (0,0), (5,0), 
(6, 2), (4,4), (2,4), and (—2, 3), in no particular order. Can he do this? If so, how? Hints: 232 


13.31 Describe the set of points X that satisfy AX CX =CB. AX, where A, B, and C are three different points. 


13.32 Regular polygon P)P2P3---P, is inscribed in a circle with center C and radius r. Show that for any point 
X, we have 


(Py X)? + (PoX)* + (P3X)* + --- + (P X} = n(r? + CX?). 


13.33 Vertex A of the acute triangle ABC is equidistant from the circumcenter O and the orthocenter H of AABC. 
Determine all possible values for the measure of angle A. (Source: IMO) Hints: 234 


13.34 Three vertices of a regular octahedron are at (2,6, —8), (6,4, —4), and (4,8,0). Find the coordinates of the 
other three vertices of the octahedron. Hints: 114, 43, 173 


13.35 Let a and b be complex numbers. Show that the area of the parallelogram in the complex plane with 
consecutive vertices at the origin, a, and b, is |1 (ab — ab) |. Hints: 167, 196 


13.36x In tetrahedron ABCD, we have AB = CD =6, AC = BD = 8, and AD = BC. Let X be the centroid of ABCD 
and Y be the centroid of AABC. Find AD if AX 1 DY. Hints: 251, 197 


13.37x A fixed point P is given inside a sphere with center O. Three mutually perpendicular rays (meaning each 
pair of rays is perpendicular) from P intersect the sphere at points U, V, and W. Point Q is the vertex diagonally 
opposite to P in the parallelepiped with edges PU, PV, and PW. Show that OQ is the same for all possible choices 
of the three mutually perpendicular rays from P. (Source: IMO) 


13.38» WY and XZ intersect at O. Points G; and G; are the centroids of triangles WXO and YZO, and points H; 
and H; are the orthocenters of XYO and ZWO. Show that G;G2 1 HHz. Hints: 247, 13, 28, 200 


13.39% The altitudes of tetrahedron ABCD are extended externally beyond A, B, C, and D to points E, F, G, and 
H, respectively, such that AE = k/ha, BF = k/h, CG = k/h,, and HD = k/hg, where k is a constant and h, denotes 
the length of the altitude of ABCD from vertex A, and likewise for hp, he and hg. Prove that the centroid of EFGH 
is also the centroid of ABCD.(Source: Canada) Hints: 100, 140 


13.40x The angles of convex hexagon ABCDEF are equal. Prove that |AB - DE| = |BC — EF| = |CD - FAI. 
Hints: 180, 42 


13.41* Let D, E, and F be on BC, AC, and AB, respectively, such that AD, BE, and CF be the angle bisectors of 
AABC. If ZEDF = 90°, then find all possible values of ZBAC. (Source: USAMO) Hints: 201, 76 


13.42x Triangle ABC is isosceles with AB = AC and BC = 1. Don AB such that AD = 1 and ZDAC = 77/7. Find 
CD. Hints: 154 


a aaaea 
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Hints to Selected Problems 


. Complete the square. (That means write the quadratic in the form a(x — h}? + k for some constants a, h, and k.) 


. All five choices are nonreal, so one of the factors of the polynomial is a quadratic with nonreal roots. Which 


of the given choices can be a root of a monic quadratic with integer coefficients? 


. Yes, you'll need a lot of algebra. Two factorizations that will be helpful are x? + 2xy + y? = (x + y} and 


xX- y* =(x-y)(xty). 


. What vector tool involves cosines? - 


. In solving the first part, you should have found equations for p and q in terms of s and t (and possibly in 


terms of some constants you defined). Prove that you can solve these equations for s and t in terms of p and 


q. 


; Let v = aì + bj + ck and F = si + tj + uk. Find s, t, and u in terms of a, b, and c. 


. Find other angles in terms of 0. A lot of triangles share side lengths in this problem. What triangle law might 


you use to take advantage of this in order to create an equation for 6? 


. Draw a diagram for the situation after B has rotated 0 counterclockwise about A. By what angle has the red 


spot rotated about the center of 8? 


. Let AC = BC = 1. Find other sides in terms of the angles. Keep going until you discover something interesting. 


. If the hole is drilled along the z-axis, then are there any restrictions on @ when describing red points that 


remain? Are there restrictions on ġ? 


. How can you get information about squares from the given equation? 


. Each term is a power of 2z. 


. To deal with hz, draw a couple altitudes in AWOZ (hz is the orthocenter of this triangle), and look for similar 


triangles. 


. Note that w is a primitive cube root of unity. So, what quadratic has w as a root? Use this to find an expression 


for œw. 
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15. 
16. 
17. 
18. 
19. 


What choice of origin will greatly simplify the right side of the given equation? 
We learned a tool in this chapter for dealing with cos n@ and sin n8. 

Let the center of the square be the origin and let k be the x-axis. 

A quadrilateral can be cut into two triangles. 


You (should have) proved similar identities as Exercises. Are any of them helpful here? 


. Lett = tan 20°. Write all three tangents in terms of t. 
. Write cos? B + cos? C in terms of cos 2B and cos 2C. Don’t forget that B + C = 180° — A! 
. What does |z| = 1 tell us about the location of z in the complex plane? 


. Let 8 be the angle that OX makes with the positive x-axis. What point has the same x-coordinate as A? What 


point has the same y-coordinate as A? 


. The angle you found in the first hint isn’t the only angle with that tangent! Tangent has a period of 7. 


. Note that cos76 = 1. Write cos 76 in terms of cos 0. Let x = cos 0. You should have a degree 7 polynomial. 


This polynomial must have 7 roots. Guess what they are! 


. Let a be the sum of the first two inverse cotangents and let f be the sum of the second two. Now does the 


problem look more manageable? 


. Let R be the radius of the Earth. Find VC, CO, and CD in terms of R. 


. Note that AWOZ and AXOY have an angle in common. So, relating h2 to this angle may help you relate h; to 


hy. 


. Time to take the training wheels off. You’re on your own on this one! 
. Use trig identities to express the ratio of tangents in terms of sin A and sin B. 
. Put the problem on the unit circle. 


. Because v and w are in a plane generated by a and b, both v and w are linear combinations of a and b. Write 


equations for v and w using this fact. 


. Under what conditions does a quadratic have real roots? 

. Let c = cos @. Find a quadratic in z that has coefficients in terms of c. 
. Think geometrically. What points satisfy |z, — 5 + 3i| < 4? 

. Letting e = a x b might help you see the first step better. 

. How can we relate sine to an exponential? 

. How isa real number related to its conjugate? 

. Factor the first two terms and the last two terms. 

. What is the distance between x + xi and i? 


. Suppose we have Mu; = v; fori = 1,2,3. If v1, v2, and v3 are linearly independent, then must u4, uz, and us 


be linearly independent? 


. Try complex numbers. Assign variables to the side lengths. Find b — a, c — b, d — c, etc., in terms of the side 


lengths and C, where Ç is a primitive sixth root of unity. 
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43. 


54. 


g5 


56. 


57. 
58. 


59, 
60. 
61. 
62. 
63. 
64. 
65. 
66. 


67. 
68. 


Can you find the center of the octahedron? 


. When you need to get rid of sines and cosines, consider the identity sin? 0 + cos? 6 = 1. 


- Let v be the vector from a point on the line to (4, —2). Onto what vector can we project v to get a vector whose 


length is the desired distance? 


. Let O be the origin, and let U, V, and W be points such that U = u, V = v, and W = w. How are OV and VU 


related? 


. Two useful tactics with quadratics are factoring and completing the square. Try them both! 

. This is the “polar coordinates” section!! 

. Let x = AC and 0 = CC. Find other angles and lengths in terms of x and 9. Try to build an equation for x. 

. Can you find t in terms of x and y? 

. Under what conditions is there a vector w besides 0 such that Mw = 0? 

. Tf sin t + i cos t were replaced with something of the form cos 0 + isin 0, you could use de Moivre’s Theorem. 


. What graph is produced when the graph of y = f(x) is shifted 6 to the right, and then the result scaled 


horizontally by a factor of 1/2? 


When it comes to matrix multiplication, what matrix plays the role that 1 plays when multiplying real 
numbers? 


If sin O = sing, then how are @ and ọ related on the unit circle! (Note: they don’t necessarily have to differ 
by a multiple of 27!) 


Write both sides in terms of complex numbers. There is a sum on the greater side of the Triangle Inequality. 
(Don’t be afraid of a little algebra!) 


What’s a good guess for the point on MN at which the lines meet? 


After writing the sum in a concise form, can you write P(x) as the product of two polynomials whose roots 
you know? 


Sometimes area can be a useful problem solving tool in problems that don’t even appear to be about area. 
Note that z = 0 if and only if |z| = 0. 

What is zz if the center of the circle is the origin and the radius of the circle is 1? 

Where do the graphs of f and g cross the x-axis? 


What angle equals ZHBX? 

Draw a very precise diagram. How does Cx appear to be related to VM? 
What trig identity does the left side resemble? 

ee ? 
What is A in terms of r and 0? 
Set the desired expression equal to x. 


Your trig identity toolbox will get a lot of work on this one! For starters, sin’ O + cos? @ = 1 isn’t the only 
identity that has cos? 6. 
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69. 


70. 
Tay 
72. 
Te 


74. 
79. 


76. 
V7. 


78. 


90. 


91. 


What do you know about wwr? What can you multiply wwz + wzw; + w3w1 by to introduce expressions you 
can evaluate? 


How is the magnitude of a complex number related to the conjugate of the number? 
Do you see any cases in which the determinant equals 0? What factors do these cases suggest? 
Notice that 40° and 80° are 20° from 60°. 


What is x f(x)? Can you use the result to find a simpler expression for f(x)? How is that expression related 
to z? 


The left side is a sum of angles. When working with complex numbers, when do we end up adding angles? 


What are the possible values of det(A)? 


Let A be the origin. Find an expression for B.C. 


You're probably stuck dealing with ee? + ee” + ee”. If so, what can you multiply the given equation by to 
get something that looks almost like ee? + ee” + efe”? And how is the result of that multiplication related 
to ee! + el el” + eB ely? 


Build right triangles. Lots of them! What do you know about the acute angles that have the center of the 
circle as the vertex? 


. Where else in math have you seen the sum of the squares of two real numbers set equal to 1? 
. Let the points be A, B, C, respectively. By what angle must we rotate B about A to get C? 
. What vector is parallel to B, By? 


. In part (a), we proved an equation involving cos 2A. In this part, we have a statement about cos* A. How can 


we relate the two? 


. Suppose the calculator evaluates sine in radians, and that the professor entered a measure t that was in 


degrees. What should the professor have entered instead of simply “t”? 


. sin? 0 + cos? 0 = 1 


. If you're stuck with a ratio that you’d like to show is imaginary, here’s a hint. If = = 3 = k, then what does 


24 
21b+23¢ 9 
eee ee equal? 


rary atl on, oe le og te E ait oe Pe 
> Thie senie i tise T pa tac. + To. telescopes, because Tn 


. If you're stuck dealing with wz + zw, try computing this expression for a few choices of w and z. Notice 


anything interesting? 


. Let S = w + 2w? + 3w +--- + 9w. What is wS? 


. Once you start dealing with the fourth powers, the algebra gets a little hairy. But don’t forget what w? +w +1 


is when w is a primitive cube root of unity! 


Geometry: Start with a regular pentagon. Let the side length of the pentagon be 2x. Add diagonals and drop 
altitudes to build some 18-72-90 right triangles. 


Find the possible values of p first. Then, for each possible value, think about where the graph crosses the 
x-axis. 
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92. No luck with the first hint? Let’s try another approach. Let x = BD and try to build an equation for x. You'll 
have to use some trig, of course! 


93. Compare |w + z|? to (|| + |z|). Remember, |z = z - Z. 


94. What is —? 


x+yi” 
95. Let the first four lines meet at the origin. 


96. Let the vertices of the triangle be the cube roots of unity. Start with PA”, PB*, and PC?. Letting p correspond 
to P, write the sum of these squares in terms of p and P. 


97. Start with an easier problem; let u = i and v =j. 


98. Think about the problem geometrically. In each part, consider what the given conditions tell us about the 
possible locations of the head of b. 


99. Sines and cosines, sines and cosines. 
100. Find a vector in terms of B, Č, and D that is in the same direction as the altitude through A. 
101. Build an equilateral triangle. 
102. How is a real number related to its conjugate? 
103. Can you pair up the angles in the numerator in a convenient way? 


104. Try the Angle Bisector Theorem. 
105. How can you relate A to BB? 


106. Let WXYZ be a square. What is the length of the projection onto XY of the vector from a point on side WX 
to a point on YZ? 


107. How would you parameterize the whole circle? What restrictions do you need in order to omit the part of 
the circle that you don’t want? 


108. You could get a closer look at this with a telescope. 
109. What is |z;|? 


110. You have two right triangles (BAM and BDM) with related angles, and you have some side lengths. Trig 
away! Try to build an equation for 0. 


111. The less fun way: Let cı and cz be the two possibilities for c. Find expressions for c1 + c2 and c;c2 in terms of 
a and b. What quadratic has c; and cz as roots? 


112. Find det(A) in terms of vı, v2, 0, and ||wl| / |lv|]. When is this determinant negative? 
113. What does z% — z8 = 1 tell us about the relationship between z” and z? in the complex plane? 


114. Suppose an octahedron has edge length x. What are the possible lengths of a segment connecting two vertices 
of the octahedron in terms of x? 


115. Write the system of equations using matrices and vectors. Do you recognize the matrix? 


116. What is the inverse of A? 


117. When finding a parameterization for the red point, focus first on its position relative to the center of B. 
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118. For the extra challenge, think of a pair of transformations such that the order in which you apply the 
transformations doesn’t affect the final result if both are applied. 


119. What is A? ic ? 


120. Let P be the foot of the altitude from Z to BC. Use the conditions that FÈ IL BC and that P is on BC to write 
two equations. Solve these equations for p. 


121. Let z = a + bi, and use the definition |z| = Va? + b?. For another approach, what is the graph of the equation 
iz — 1| = |z + 3| on the complex plane? 


122. Can sin? 6 ever be larger than sin? 0? 
123. All of the entries in the matrix are even. 
124. What complex number has argument arctan }? 


125. Use the described property of the function to write an equation in terms of a, b, and z. Remember, this 
equation must hold for all z. 


126. Algebra: Let x = cos 36° and y = cos 72° = sin 18°. Use trig identities to relate x and y. 

127. Let A be the origin. What must be true about (m — n)/(n’ — b’) if and only if MN + B’N’? 

128. Eliminate the denominators! 

129. Two 60° angles; it sure would be nice if there were a third. 

130. Yes, the algebra gets a little nasty on this problem. Don’t give up too soon, and stay organized. 

131. Choose several values of x to come up with a guess for the period. 

132. What is (a + bi)(c + di)? 

133. Find an expression for AB/BC in terms of 6. The first hint should be helpful for this. 

134. Let A be the foot of the altitude from (4, —2) to the line. How is the vector from A to (4, —2) related to the line? 
135. Any three vectors in R? are linearly dependent. 

136. What is the determinant of A”? 

137. You can’t find the actual solutions without a calculator, but you can find sums of pairs of the solutions. 
138. If you know what proj, w is, can you figure out what w is? 

139. This problem would be a whole lot easier if v or w were simply 1. 

140. The vector DC x DB is normal to what plane? 

141. Show that each root of x” — 1 is a root of exactly one y,(x), and vice versa. 

142. Recall that x"! + x"? + ---+x4+1=(x-w)(x —w?)---(x —w""!), where w = &2%/", 


143. Let z be a complex number (point in the complex plane), and let f(z) be its image. How can we write the 
distance between these two points as an algebraic expression? 


144. What is the signed area of the parallelogram with Ti and Tj as sides? 


145. Make a clever substitution to take care of the 4” term. 
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146. 
147. 
148. 
149. 


150. 


151. 
152. 


153. 
154. 
Ss: 


156. 
157. 


158. 


159. 


160. 


161. 
162. 
163. 
164. 
165. 


166. 
167. 
168. 


169. 
170. 


Find g`! (x) in terms of a, b, c, and d. 
What identity do you know that involves tangent and secant? 
Find a parameterization for the path of the center of B. 


Let Bı and By be the points on k; and kz, respectively, such that B; By is the shortest distance between the two 
lines. How is B;B> related to the two lines? 


Create equations involving p and p with the two conditions given for p: it is on the line through a and b, and 
the line through p and z is perpendicular to the line through a and b. 


How can you interpret the problem geometrically? 


Just for a moment, let a = a, b = B, c = ay, and d = b, in the right sides of the equations for ay41 and by41. 
Where have you seen expressions like that before when working with complex numbers? 


What is tan(a + b + c)? 
m/7 is a fourteenth root of unity. Put on your complex number algebra hat. 


The expression for f(x) is almost a geometric series. What strategy do you use to prove the formula for the 
sum of a geometric series? 


What must x be to make -#4 equal sec? 6? 


Geometry: Convert 18° to radians. What geometric figure gives you angles in radians with factors of 5 in the 
denominators? 


Replace 30 with another variable. What points on the unit circle are terminal points of angles that satisfy the 
resulting inequality? 


Sines and cosines are usually easier to deal with than cotangents. 


Let A = ie: a , so that v and w are the columns of A. Express the components of w in terms of ||wIl, ||v|l, 
2 W2 


and 0, where @ is the smallest counterclockwise rotation that must be applied to v to make the result have 
the same direction as w. 


Put the rectangle on the complex plane with one vertex at the origin and two sides on axes. 
Can you write a trigonometric equation that has 0 = 27/7, 0 = 47/7, and 0 = 67/7 as solutions? 
You know how to find the cosine of the angle between two vectors. 

What is true about the diagonals of a parallelogram? 


The series is almost a geometric series. What strategy do you use to prove the formula for the sum of a 
geometric series? 


Remember that y is constant. Can you write the expression in terms of only y? 


Complex numbers are vectors. 


Lets = sin @ + cos @ and p = (sin @)(cos 0). Find two equations relating s and p, one with trig identities, and 
the other with the given equation. 


Find two expressions for BD”. Use them to find an expression of a* — b? — c? +d’. 


After moving all the terms to one side, what type of series do you have? 
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171. The cross-section given in the problem is not the only useful cross-section. Consider the cross-section that is 
perpendicular to the Earth’s axis and includes Vancouver. Let A and B be the points on the circumference 
of this cross-section that divide the cross-section into “night” and “day.” Where are points C and D in this 
cross-section? 


172. Let Py be w* and let z correspond to z. Express PP? in terms of z, Z, and w. 
173. The space diagonals of an octahedron are perpendicular. 


174. Suppose ki is one of the imaginary roots. Let x = ki in the equation. What must both the resulting real and 
imaginary parts equal? 


175. Build triangles, and remember that equal angles have equal cosines. 


176. The angle bisectors of a triangle meet at the incircle of the triangle. Build a right triangle with an angle that 
has measure 4 


Gp . 
177. Take it in steps. How would you parameterize the circle if it were centered at the origin? 
178. The left side is the cube of a product. What product? And what sum can you write that product as? 


179. When we say multiple, we mean multiple! As in, all of them. All 6 of them. Then, start from the relationships 
that involve the side you want (BE), and try to work as much as possible with sides you know and angles 
with equal sines. 


180. You can find the angles of the hexagon. You then know how much you have to rotate each side in order to 
match the direction of an adjacent side. What tool works well with rotations? 


181. If vi, v2, and v3 are linearly independent vectors in R?, then any vector in R? can be written as a linear 
combination of v;, v2, and v3. 


182. Start with a polynomial that has all 12 twelfth roots of unity as roots. 


183. You answer to first hint should give you expressions of the form e — e~. We'd rather deal with expressions 
of the form 1 — e“, since we've seen these sorts of expressions before. (Where?) 


184. What is f(t)? 
185. We have f(f(x)) = x for all x. Choose a convenient x. 
186. Start with two triangles, AABD and ABCD, as in the first part. Notice that these triangles share a side. 


187. Draw a diagram that has a right triangle with one side of length sin & and hypotenuse of length 1. Find an 
angle in the diagram that has cosine equal to sin &. 


188. Show that each root of ®2,,(x) is a root of ®,(—x), and vice versa. What are the negatives of the roots of ®,,(—x)? 


189. Try using a few different identities to rewrite the each of the first three or four terms in the sum. Do any of 
your methods allow you to combine terms conveniently? 


190. Consider the cubic polynomial with roots e’*, e”, and e”. Expand it—what do you know about the resulting 
coefficients? 


191. You probably don’t need a hint to try complex numbers. But you might need a hint to deal with the expression 
you'll get for O;O>. Here are a couple: if w = e™/13, then what is w! and what power of w equals wk? 


192. What complex number has argument arctan 2? 


193. Buy low and sell high!!! How many times can you do so? 
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194. In what other area formula do we have the product of two side lengths? In what other triangle formula do 
we have the circumradius? 


195. Try turning each product into a sum. 
196. We have a triangle area formula that involves sine. We also found a vector formula that involves sine. 


197. Let t be the desired length. You should be able to find all the dot products in your equation, or to express 
them in terms of t. 


198. Can you find the area of the triangle with the three given points as vertices? 


199. Let AB = bii + Dj + bsk, AC = cii + coj + c3k, and AD = dii + d2j + dk. For each part, what determinant (in 
terms of the components of these vectors), equals the desired volume of the described parallelepiped? 


200. Show that hz = i(w — z) cot ZWOZ. 
201. As an Exercise, you should have learned how to deal with angle bisectors of a triangle with vectors. 
202. What transformation has no final effect if applied three times? 


203. A + B + C = 180°. Expressions with two variables are often easier to work with than expressions with three 
variables! 


204. The professor entered t, and should have entered the expression you found in the first hint. And yet the 
professor got the right answer! So, what equation does that give you? 


205. Find the angles at P. Then what equations can you write? 


206. Experiment! Compute a few of the z;. Notice anything interesting? 


207. The expression Je + (x- 1? looks like the distance formula. 
208. What trig identity does the right side of the equation resemble? 
209. That sum can be written in a much more concise form! 

210. Show that f(w) = proj, w is a linear function. 

211. How can you relate sines and cosines to exponentials? 


212. What identity does the expression (x — y)/(1 + xy) resemble? What substitution might you make for x and y 
to recast this problem as a problem involving angles instead of one involving real numbers? 


213. If det(A — AI) £ 0, then what vectors v satisfy the equation (A — AI)v = 0? 
214. Find a convenient way to pair up the terms. 
215. What choice of origin allows you to take advantage of all those right angles? 


216. Letb, = a,/2". Write the recursion in terms of b;,4.1 and by. What substitution for b, does the resulting recursion 
suggest? 


217. How are the x; + yzi related to the tenth roots of unity? 


218. Focus on the two circles that produce Z3. Use similar triangles to express z3 in terms of the radii of the circles 
and the complex numbers corresponding the centers of the circles. 


219. Why must ZSOC = 66.5° on the longest day of the year in Vancouver? (If you can’t figure this out, assume it 
is true and try to finish the problem from here.) 
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220. Any point on an angle bisector of an angle is equidistant from the sides of the angle. 
221. Express A? in terms of A, I, and entries in A. 


222. Forget about matrices for a minute. What pairs of integers (a, b) satisfy ab = a + b? Try to find a solution that 
doesn’t involve division; there’s no “division” in matrices! 


223. Product of terms. Double angles. Which identity should you try to use? 


224. How are ZAOB and ZACB related in an acute triangle ABC? A right triangle with ZC = 90°? An obtuse 
triangle with ZC > 90°? 


225. Here are two first steps you might consider for turning the equation into something simpler you can handle: 
(1) Write the equation in terms of sine and cosine, (2) View the equation as a quadratic in tanx. Again, 
remember that you don’t have to find x! 


226. Thea? +b? suggests complex numbers. Let z = a+bi. Can you write an equation in terms of z that is equivalent 
to the given system? 


227. How can you factor x” — 1 as the product of n polynomials? How can you factor x" — 1 as the product of 2 
polynomials? 


228. Let PB = x. Can you find PA’ in terms of x? 

229. Two angles that are inscribed in the same arc of a circle are equal. 
230. What is A”? 

231. How is v — F related to the plane? 


232. Let the hexagon be ABCDEF. Write equations involving vectors to the vertices. Can you group the equations 
conveniently? 


233. The more fun way: What cubic has roots a, b, and c? What simple cubic has roots that are the vertices of an 
equilateral triangle? 


234. Find ZBOC first. 


235. After solving the problem in which the hole is drilled along the z-axis, how can you quickly change your 
result to account for a hole being along the x-axis? 


236. Where have you seen an expression of the form (x — w)(x — w”)(x - w°)? 
237. Draw a picture. 
238. Find other angles in terms of 0. Did you discover anything interesting? 


239. After you havea guess that the period is p, use the unit circle to show that we must always have f(x+p) = f(x). 
(Focus on sine and cosine separately, and don’t forget to show that no smaller period is possible.) 


240. Solve the first equation for y, the second for z, and the third for x. What trig identity resembles the resulting 
equations? 


241. There are three pairs of angles in the problem with equal sines. Try applying the Law of Sines to multiple 
triangles, focusing on sides you know and these pairs of angles. 


242. Find the volume of the parallelepiped with AB, AC, and AD as edges. Try to relate the desired volumes to 
this volume. 
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243. You'll need to be able to express the center of an equilateral triangle on the complex plane in terms of its 


vertices. The midpoint of a segment is the average of its endpoints. What do you think the center of an 
equilateral triangle is? 


244. First, consider the problem in which the hole is drilled along the z-axis. 
245. Notice anything interesting about that particular matrix? 


246. The equation isn’t in the form of an ellipse with a horizontal major axis. Maybe it’s the result of rotating an 
ellipse with horizontal major axis about the origin? 


247. Try complex numbers. Show that (h — h)/(g2 — 91) is imaginary. 
248. The Law of Cosines is not the only way to introduce cosine; what’s sin 26? 


249. Focus on the four key points: the two ends of the graph and the two points where the graph changes direction. 
Where are these points on the new graph? 


250. Because you're looking back here, you've discovered that this problem is a lot harder than it looks! Find lots 
of angles, and keep your eye out for supplementary angles. What trig law can we sometimes use in problems 
with convenient supplementary angles? 


251. Let A be the origin. Write an equation for the given perpendicularity. 
252. The seventh roots of unity are the roots of what polynomial? 
253. Note that x and y can each be any real number. So can the tangent of an angle. 


254. If x? +x +1 = 0, then how can you write (x + 1)” as a power of x? 


255. The expression ia should be familiar—geometrically speaking, how is this expression related to a and v? 
a 


256. There’s a reason this part comes after part (a)! 
257. The equation is just a quadratic; you can find z! 
258. There are a lot of right angles at D. 


259. Many 3D triangles are 2D problems in disguise. What does the problem look like if you unroll the curved 
surface of the cone? 


260. Break the quadrilateral into two triangles, AABD and ABCD. How are sin A and sinC related? And how can 
you get these from cos A? 


261. Sine and cosine are often easier to deal with than tangent. 
262. Find two quadratics whose product is the given quartic. 


263. Use wishful thinking. Can you change a single term in the polynomial into something else to make the 
polynomial into one you can handle? What's the difference between the polynomial you are given and the 
one you can handle? 


264. For each of the first four lines, write an equation that means that the origin is on the line. Be careful to write 
them so that you can use convenient symmetry in your equations! 


265. Creatively add segments to AABC to create an angle equal to 4B — Ze: 


266. Connect all three points to the center. 
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267. 


268. 
269. 
270. 
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275. 


276. 
277, 
278. 
279. 
280. 
281. 


282. 
283. 
284. 


285. 


286. 


287. 
288. 


Two of the terms in the equation are reciprocals. What is the smallest possible value of the sum of a positive 
number and its reciprocal? 


Write the equation for the line in terms of w, then let w = a + bi. 
Let A be the origin and find every point you can in terms of band c. 


Focus on linearly dependent vectors first, since that gives you an equation to work with: ap + bq = 0 for 
some constants a and b that are not both 0. 


Try some specific examples, such as f(x) = x° or f(x) = x +3. 
The left sides look like the Binomial Theorem. 
Cosines and vectors. What tool should you consider? 


Why must there be scalars p1, 91, p2, 2 such that v = pja + qib and w = pra + q2b? 


Let X be the intersection of the lines containing the altitudes from C and E. Suppose AX -FB = k. What do we 
wish to prove k equals? 


Suppose the difference in arguments between v and w is a. Show that |v + w| = |1 + er 
How are ab and a + b related to (a — 1)(b — 1)? 

Area. 

Draw altitude BY. What angle of the triangle equals ZAHY? 

We have two cosines; try building a right triangle. 


Compare this equation to the condition we proved in the text under which a, b, and c are the vertices of an 
equilateral triangle in the complex plane. What must be true about w and C if both equations are true? 


Use trig identities to express the right side as the tangent of an angle. 
All the variable terms are powers of z’. If you let t = z°, do you get a polynomial you recognize? 


Choose your origin wisely. Specifically, choose it in a way that wipes out a lot of the terms in the expression 
you must show equals 0 in order to show that z1, z2, and zg are collinear. : 


It’s the last problem of the book—you’re on your own! 


Your expression for [ABCD] from the first hint should look vaguely similar to your expression for a? — b? — 
c? + d*. One should have sines where the other has cosines. One way to make sines and cosines go away is 
sin? 0 + cos? @ = 1. 


Compare the expression the Shoelace Theorem gives for n = k to the result when n = k +1. 


What is |w + z? — ((wl + |z|)*? 
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30-60-90 triangle, 32 
45-45-90 triangle, 32 


Agnesi, Maria Gaetana, 191 
Airy, Sir George, 180 
AMC, vii 
American Mathematics Competitions, see AMC 
American Regions Math League, see ARML 
amplitude, 56 
angle 
dihedral, 155 
inscribed, 141 
Angle Bisector Theorem, 155 
angle difference identity, 91-99 
angle of depression, 129, 130 
angle of elevation, 129, 130 
angle sum identity, 91-99 
angular coordinate, 167 
arccos, 68 


arccot, 68 

arccsc, 68 

arcsec, 68 

arcsin, 68 

arctan, 68 

argument, 220 

ARML, vii 

Art of Problem Solving, v, 512 

associative property 
complex numbers, 195 

asymptote, 50 


bilinear, 308, 383 
box product, 451 
Brahmagupta’s formula, 157 
Bretschneider’s formula, 157 


Cartesian space, 177 
Cauchy-Schwarz Inequality, 311 
centroid, 272, 470 

of a tetrahedron, 475 

vector representation, 470 
Chebyshev polynomial, 224 
circle 

complex number representation, 201, 274 
circle of Apollonius, 275 
circular reasoning, 231 
circumcircle, 144, 148 
circumradius, 144, 148, 474 
cis, 220 
cofactor, 405 
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collinearity, 266, 270 
complex numbers and, 266, 270 
column vector, 343, 409 
commutative property 
complex numbers, 195 
complex number, 192-297 
argument, 220 
arithmetic, 192-197 
associative property, 195 
commutative property, 195 
complex plane, 198-202 
conjugate, 195 
distributive property, 195 
equilateral triangle, 280 
exponential form, 225-233 
geometry, 256-297 
graphing, 198-202 
imaginary part, 192, 202-208 
magnitude, 200, 220 
multiplicative inverse, 196 
norm, 200 
parallelogram law, 217 
polar form, 219-225 
real part, 192, 202-208 
rectangular form, 220 
root of unity, 233-249 
square root, 193, 206 
complex number geometry, 256-297 
circle, 201, 274 
collinearity, 266, 270 
midpoint, 264 
parallel lines, 267, 270 
perpendicular lines, 268, 270 
regular polygon, 278-286 
complex plane, 198-202 
component, 298 
composition (of functions), 16-19 
conjugate, 195 
conjugate pair, 195 
converse, 141 
conversion factor, 41 
coordinates 
cylindrical, 178-187 
polar, 167-176 
rectangular, 178 
spherical, 179, 187 
cos, 30 
cos7!, 68 
cosecant, 30 
graph of, 52 
cosine, 30 
angle difference identity, 95 


angle sum identity, 95 
difference-to-product identity, 109 
double angle identity, 101 
graph of, 48 
half angle identity, 104 
in terms of e, 231 
period of, 47 
sum-to-product identity, 109 
triple angle identity, 106 

cot, 30 

cotangent, 30 

Cramer’s rule, 357, 397 

cross product, 442-449 

csc, 30 

cyclic quadrilateral, 157 

cycloid, 165 

cyclotomic polynomial, 253 

cylindrical coordinates, 178-187 


de Moivre’s Theorem, 223, 227 
degree, 212 
determinant, 357 
geometric interpretation (2 x 2), 364-369 
geometric interpretation (3 x 3), 450-454 
of a2 x 2 matrix, 355-369 
of a 3 x 3 matrix, 395—408, 450-454 
diagonal matrix, 348 
difference-to-product identity, 106-110 
dihedral angle, 155 
dilation, 257 
center of, 257 
scale factor, 257 
dimensionless unit, 40 
direction vector, 316, 423 
distributive property 
complex numbers, 195 
domain, 1 
dot product 
in three dimensions, 382-384 
in two dimensions, 305-312 
double angle identity, 100-106 
double root, 217 
dummy variable, 1 


e, 227-228 
Edwards, Miles Dillon, 484 
eigenvalue, 419 
eigenvector, 419 
ellipse, 175 
equation 

parametric, 158-167 
equator, 180 
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equilateral triangle 

complex numbers and, 280 
Euler line, 471, 472 
even function, 50 
excircle, 474 
expansion by minors, 404 
exponential form, 225-233 
Extended Law of Sines, 144 
extraneous solution, 85 


f~! (functions), 20 
Factor Theorem, 209 
Feuerbach’s Theorem, 474 
fractals, xiv 
frequency, 61, 111 
function, 1-28 
composition, 16-19 
domain, 1 
even, 50 
graphing, 9-16 
inverse, 19-24 
invertible, 22 
linear, 308, 311 
odd, 50 
periodic, 46 
range, 1 
real, 2 
real-valued, 2 
transformation, 12-16 
trigonometric, 29 
Fundamental Theorem of Algebra, 209 


generalization, 138 
generated, 426, 427 
geometric transformation, 256 
complex numbers and, 256-265 
geometry 
complex number, 256-297 
great circle, 180 
Greek letter, 29 
guitar, 111 


half angle identity, 100-106 
Harvard-MIT Math Tournament, see HMMT 
head, of a vector, 303 
head-to-tail addition, 303 
Heron’s formula, 152 
proof with complex numbers, 485 
HMMtT, viii 
horizontal line test, 23 
hyperbolic cosine, 252 
hyperbolic sine, 252 


hyperbolic tangent, 252 
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identity (trigonometric), 38, 83-127 
angle difference, 91-99 
angle sum, 91-99 
difference-to-product, 106-110 
double angle, 100-106 
half angle, 100-106 
product-to-difference, 106-110 
product-to-sum, 106-110 
sum-to-product, 106-110 
triple angle, 106 

identity matrix, 350, 393 
2 x 2, 350 
3 x 3, 393 

if and only if, 204 

i295 

Im, 202 

imaginary axis, 198 

imaginary number, 192 

imaginary part, 192, 202-208 

incircle, 148, 474 

inner product, 306 

inradius, 148, 474 

inscribed angle, 141 

instrument tuning, 111 

intercept, 9 

intersection, of intervals, 6 

interval 
intersection, 6 
union, 6 

interval notation, 2 

inverse cosine, 68 

inverse function, 19-24 
graph of, 23 

inverse matrix (2 X 2), 369-375 

inverse matrix (3 x 3), 454-457 

inverse sine, 68 

inverse tangent, 68 

inverse trigonometric function, 64-76 

invertible, 372 

irreducible polynomial, 242 

isometry, 257 


Jacobi Identity, 449 
kite, 469 


Lagrange Identity, 449 
latitude, 180 

lattice point, 369 

Law of Cosines, 136-141 
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Law of Sines, 141-148 
Law of Tangents, 152 
length 
of a vector, 381 
line 
definition of, two dimensions, 316 , 
distance from a point to, 330 
linear combination, 319, 421 
linear function, 308, 311 
linearly dependent, 322, 441 
linearly independent, 322, 441 
links, vi 
logarithm 
base e, 255 
longitude, 180 


magnitude, 220 
of a complex number, 200 
of a vector, 381 
Mandelbrot Competition, vii 
Mandelbrot set, xv 
matrices, 335 
matrix 
2 x 2, 335-379 
3 x 3, 384-408 
addition, 338 
cofactor, 405 
determinant, 355-369, 395-408, 450-454 
diagonal, 348 
dimensions, 409 
eigenvalue, 419 
eigenvector, 419 
identity, 350, 393 
inverse (2 x 2), 369-375 
inverse (3 x 3), 454-457 
multiplication, 342 
nonsingular, 362 
permutation, 394 
rotation, 352 
singular, 362 
skew-symmetric, 419 
square, 414 
transpose, 378, 399 
Zero, 336 
median, 272 
of a tetrahedron, 475 
midpoint 
in the complex plane, 264 
minor, 404 
Monge’s Theorem, 289 
monic, 212 
multiplicative inverse, 196 


New York State Math League, viii 
nine-point circle, 471 
complex numbers, 479 
nonreal, 192 
nonsingular matrix, 362, 397 
norm 
in three dimensions, 381 
in two dimensions, 300 
of a complex number, 200 
normal, 323, 438 
number 
complex, 192 
imaginary, 192 
nonreal, 192 
real, 192 
NYSML, viii 


odd function, 50 

orientation, 256 

origin, 198 

orthocenter, 471 
complex numbers, 479 
vector representation, 471 

orthogonal, 310, 384 

Outer Napoleon Triangle, 288 


parallel lines 

complex numbers and, 267, 270 
parallel planes, 427 
parallelepiped, 450 
parallelogram 

area of, 449 
parallelogram law, 217 
parameter, 158 
parameterization, 163 
parameterize an equation, 164 
parametric equation, 158-167 
period, 46 
periodic function, 46 
permutation matrix, 394 
perpendicular lines 

complex numbers and, 268, 270 
phase shift, 58 
Pigeonhole Principle, 119 
pirate, 263 
pitch, 111 
plane, 427 

normal, 438 

parallel, 427 

through the origin, 426 
polar coordinates, 167-176 
polar form, 219-225 
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polynomial 

complex roots of, 208-214 

cyclotomic, 253 

degree, 212 

Factor Theorem, 209 

irreducible, 242 

monic, 212 

Rational Root Theorem, 209 
prime meridian, 180 
primitive root of unity, 238 
problem solving, iii 
product-to-difference identity, 106-110 
product-to-sum identity, 106-110 
proj, 323, 328 
projection, 323-331 
prosthaphaeresis, 124 
Ptolemy’s Inequality, 287 
Pythagorean Theorem, 39 


quadrant, 35 


R, 2 
R\, 6 
radial coordinate, 167 
radian, 40-44 
range, 1 
Rational Root Theorem, 209 
Re, 202 
real axis, 198 
real function, 2 
real number, 192 
real part, 192, 202-208 
rectangular coordinates 
three dimensions, 178 
two dimensions, 167 
rectangular form, 220 
reflection, 257 
regular polygon 
complex numbers and, 278-286 
resources, V 
right triangle trigonometry, 35, 128-136 
right-hand rule, 445 
root of unity, 233-249 
primitive, 238 
rotation, 256 
angle, 256 
center, 256 
orientation, 256 
rotation matrix, 352 
row vector, 343, 409 
Rule of Sarrus, 397 


scalar, 298 


scalar triple product, 451 

scale factor, 257 

sec, 30 

secant, 30 

semiperimeter, 148 

Shoelace Theorem, 379 

signed area, 367 

signed volume, 451 

Simson Line, 291 

sin, 30 

sin7!, 68 

sine, 30 
angle difference identity, 95 
angle sum identity, 95 
difference-to-product identity, 109 
double angle identity, 101 
graph of, 47 
half angle identity, 104 
in terms of e, 231 
sum-to-product identity, 109 
triple angle identity, 106 

sine curve, 48 

singular matrix, 362, 397 

sinusoid, 48 

sinusoidal variation, 61 

skew-symmetric matrix, 419 

slope, 30, 131 

SOHCAHTOA, 128 

sound, 111 

sound waves, 111 

spherical coordinates, 179, 187 

square matrix, 414 

Stewart’s Theorem, 151 

sum-to-product identity, 106-110 


tail, of a vector, 303 
tan, 30 
tan“!, 68 
tangent, 30 
and slope, 131 
angle difference identity, 95 
angle sum identity, 95 
double angle identity, 101 
graph of, 51 
half angle identity, 104 
Tchebychev polynomial, 224 
telescoping series, 116 
terminal point, 30 
tetrahedron 
centroid, 475 
median, 475 
three-dimensional space, 177 
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transformation, 256 
translation, 256 
transpose, 378, 399, 419 
of a2 x 2 matrix, 378 
of a3 x 3 matrix, 399 
ofan r x c matrix, 419 
triangle area, 133 
Triangle Inequality, 275, 277 
trig, 30 
trigonometric function, 29 
graph of, 45—64 
in a right triangle, 35 
inverse of, 64-76 
trigonometric identity, 38, 83-127 
trigonometry, 29-157 
Extended Law of Sines, 144 
Law of Cosines, 136-141 
Law of Sines, 141-148 
Law of Tangents, 152 
right triangle, 128-136 
triple angle identity, 106 


union, of intervals, 6 

unit circle, 30 

unit vector, 299, 381 

upper triangular matrix, 408, 457 


USA Mathematical Talent Search, see USAMTS 


USAMTS, vii 


variable 
dummy, 1 
variation 
sinusoidal, 61 
vector, 298 


centroid of a tetrahedron, 475 


centroid of a triangle, 470 
column, 343 

component, 298 

cross product, 442—449 
direction, 302, 381 


dot product, 305-312, 382-384 


head, 303 
head-to-tail addition, 303 


in three dimensions, 380-384 


in two dimensions, 298-334 
inner product, 306 

length, 300, 381 

linear dependence, 313-322 
magnitude, 300, 381 

norm, 300, 381 

normal, 323, 438 
orthocenter, 471 
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projection, 323-331 
row, 343 
tail, 303 
unit, 299, 381 
zero, 299 
vertical line test, 12 
Vieta’s formulas, 246 


witch of Agnesi, 191 


x-intercept, 9 
xy-plane, 177 
xz-plane, 177 


y-intercept, 9 
yz-plane, 177 


z, 195 

zero matrix, 336 

zero vector, 299 
zero-product property, 210 
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